youralien commited on
Commit
fdbc274
·
verified ·
1 Parent(s): eadf306

End of training

Browse files
Files changed (1) hide show
  1. README.md +30 -8
README.md CHANGED
@@ -14,12 +14,32 @@ model-index:
14
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
15
  should probably proofread and complete it, then remove this comment. -->
16
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
17
  # ModernBERT-Questions-goodareas-classifier
18
 
19
  This model is a fine-tuned version of [answerdotai/ModernBERT-base](https://huggingface.co/answerdotai/ModernBERT-base) on an unknown dataset.
20
  It achieves the following results on the evaluation set:
21
- - Loss: 0.8440
22
- - F1: 0.8469
23
 
24
  ## Model description
25
 
@@ -38,21 +58,23 @@ More information needed
38
  ### Training hyperparameters
39
 
40
  The following hyperparameters were used during training:
41
- - learning_rate: 1e-06
42
- - train_batch_size: 32
43
  - eval_batch_size: 16
44
  - seed: 42
45
  - optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
46
  - lr_scheduler_type: linear
47
- - num_epochs: 3
48
 
49
  ### Training results
50
 
51
  | Training Loss | Epoch | Step | Validation Loss | F1 |
52
  |:-------------:|:-----:|:----:|:---------------:|:------:|
53
- | 0.013 | 1.0 | 231 | 0.8559 | 0.8480 |
54
- | 0.0086 | 2.0 | 462 | 0.8776 | 0.8432 |
55
- | 0.0081 | 3.0 | 693 | 0.8440 | 0.8469 |
 
 
56
 
57
 
58
  ### Framework versions
 
14
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
15
  should probably proofread and complete it, then remove this comment. -->
16
 
17
+ [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/ryanlouie2021-stanford-university/modernbert-Reflections-goodareas-sweeps/runs/3u6o9mcs)
18
+ [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/ryanlouie2021-stanford-university/modernbert-Reflections-goodareas-sweeps/runs/ver7zql6)
19
+ [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/ryanlouie2021-stanford-university/modernbert-Reflections-goodareas-sweeps/runs/bgkconer)
20
+ [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/ryanlouie2021-stanford-university/modernbert-Reflections-goodareas-sweeps/runs/ginadzp8)
21
+ [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/ryanlouie2021-stanford-university/modernbert-Reflections-goodareas-sweeps/runs/6ofwzhfk)
22
+ [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/ryanlouie2021-stanford-university/modernbert-Reflections-goodareas-sweeps/runs/bfld45b3)
23
+ [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/ryanlouie2021-stanford-university/modernbert-Reflections-goodareas-sweeps/runs/mjq9j1y2)
24
+ [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/ryanlouie2021-stanford-university/modernbert-Reflections-goodareas-sweeps/runs/ppaajpfs)
25
+ [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/ryanlouie2021-stanford-university/modernbert-Reflections-goodareas-sweeps/runs/f4splclm)
26
+ [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/ryanlouie2021-stanford-university/modernbert-Reflections-goodareas-sweeps/runs/0iqpwizp)
27
+ [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/ryanlouie2021-stanford-university/modernbert-Reflections-goodareas-sweeps/runs/re0dfpi3)
28
+ [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/ryanlouie2021-stanford-university/modernbert-Reflections-goodareas-sweeps/runs/1qnrtp6c)
29
+ [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/ryanlouie2021-stanford-university/modernbert-Reflections-goodareas-sweeps/runs/vhi1pheu)
30
+ [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/ryanlouie2021-stanford-university/modernbert-Reflections-goodareas-sweeps/runs/a6s13itb)
31
+ [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/ryanlouie2021-stanford-university/modernbert-Reflections-goodareas-sweeps/runs/pxufas3e)
32
+ [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/ryanlouie2021-stanford-university/modernbert-Reflections-goodareas-sweeps/runs/hgqk1sp5)
33
+ [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/ryanlouie2021-stanford-university/modernbert-Reflections-goodareas-sweeps/runs/hutjz5n2)
34
+ [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/ryanlouie2021-stanford-university/modernbert-Reflections-goodareas-sweeps/runs/rd0hak7n)
35
+ [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/ryanlouie2021-stanford-university/modernbert-Reflections-goodareas-sweeps/runs/0g9pz5x1)
36
+ [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/ryanlouie2021-stanford-university/modernbert-Reflections-goodareas-sweeps/runs/noplgwdw)
37
  # ModernBERT-Questions-goodareas-classifier
38
 
39
  This model is a fine-tuned version of [answerdotai/ModernBERT-base](https://huggingface.co/answerdotai/ModernBERT-base) on an unknown dataset.
40
  It achieves the following results on the evaluation set:
41
+ - Loss: 2.3572
42
+ - F1: 0.8616
43
 
44
  ## Model description
45
 
 
58
  ### Training hyperparameters
59
 
60
  The following hyperparameters were used during training:
61
+ - learning_rate: 7e-05
62
+ - train_batch_size: 16
63
  - eval_batch_size: 16
64
  - seed: 42
65
  - optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
66
  - lr_scheduler_type: linear
67
+ - num_epochs: 5
68
 
69
  ### Training results
70
 
71
  | Training Loss | Epoch | Step | Validation Loss | F1 |
72
  |:-------------:|:-----:|:----:|:---------------:|:------:|
73
+ | 0.4695 | 1.0 | 461 | 0.4532 | 0.8288 |
74
+ | 0.4062 | 2.0 | 922 | 0.4762 | 0.8350 |
75
+ | 0.2417 | 3.0 | 1383 | 0.6892 | 0.8537 |
76
+ | 0.116 | 4.0 | 1844 | 2.0905 | 0.8611 |
77
+ | 0.0245 | 5.0 | 2305 | 2.3572 | 0.8616 |
78
 
79
 
80
  ### Framework versions