BigTMiami commited on
Commit
40aaaff
·
verified ·
1 Parent(s): c00e63c

End of training

Browse files
README.md CHANGED
@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
15
 
16
  This model is a fine-tuned version of [roberta-base](https://huggingface.co/roberta-base) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
- - Loss: 1.5492
19
 
20
  ## Model description
21
 
@@ -35,11 +35,11 @@ More information needed
35
 
36
  The following hyperparameters were used during training:
37
  - learning_rate: 0.0005
38
- - train_batch_size: 32
39
- - eval_batch_size: 32
40
  - seed: 42
41
- - gradient_accumulation_steps: 11
42
- - total_train_batch_size: 352
43
  - optimizer: Adam with betas=(0.9,0.98) and epsilon=1e-06
44
  - lr_scheduler_type: linear
45
  - lr_scheduler_warmup_ratio: 0.06
@@ -49,14 +49,12 @@ The following hyperparameters were used during training:
49
 
50
  | Training Loss | Epoch | Step | Validation Loss |
51
  |:-------------:|:-----:|:----:|:---------------:|
52
- | 1.93 | 0.97 | 27 | 1.6121 |
53
- | 1.6683 | 1.98 | 55 | 1.5535 |
54
- | 1.6252 | 2.99 | 83 | 1.5258 |
55
- | 1.6651 | 3.97 | 110 | 1.5424 |
56
- | 1.6085 | 4.98 | 138 | 1.5716 |
57
- | 1.6078 | 5.99 | 166 | 1.5710 |
58
- | 1.6158 | 7.0 | 194 | 1.5807 |
59
- | 1.6491 | 7.97 | 221 | 1.5848 |
60
 
61
 
62
  ### Framework versions
 
15
 
16
  This model is a fine-tuned version of [roberta-base](https://huggingface.co/roberta-base) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Loss: 1.5970
19
 
20
  ## Model description
21
 
 
35
 
36
  The following hyperparameters were used during training:
37
  - learning_rate: 0.0005
38
+ - train_batch_size: 21
39
+ - eval_batch_size: 21
40
  - seed: 42
41
+ - gradient_accumulation_steps: 2
42
+ - total_train_batch_size: 42
43
  - optimizer: Adam with betas=(0.9,0.98) and epsilon=1e-06
44
  - lr_scheduler_type: linear
45
  - lr_scheduler_warmup_ratio: 0.06
 
49
 
50
  | Training Loss | Epoch | Step | Validation Loss |
51
  |:-------------:|:-----:|:----:|:---------------:|
52
+ | 1.7917 | 1.0 | 232 | 1.5959 |
53
+ | 1.7109 | 2.0 | 465 | 1.6216 |
54
+ | 1.7571 | 3.0 | 697 | 1.6839 |
55
+ | 1.8098 | 4.0 | 930 | 1.7498 |
56
+ | 1.9035 | 5.0 | 1162 | 1.8368 |
57
+ | 1.9617 | 6.0 | 1395 | 1.9273 |
 
 
58
 
59
 
60
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:06e752298faeb7db05bcaabe89a32a756a27e99b1a3cfd8d11f49aaed472f5b2
3
  size 498813948
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:02ff596dc4f8ca4004dc31a28166909afb72b836e9ac5845ba74bf3b001338c3
3
  size 498813948
runs/Apr16_13-28-55_3749cd13d26a/events.out.tfevents.1713274137.3749cd13d26a.245.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:f13f38151169e06cb64a3b10d965d7ca6391938d2e6f39a374939fc40375031a
3
- size 7554
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8ee412ca9d4b9f72aca8cc505a611dbfe520a3200b4ef297af11509dcabb56d6
3
+ size 7908
runs/Apr16_13-28-55_3749cd13d26a/events.out.tfevents.1713274972.3749cd13d26a.245.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:49455b7f8ceb4849a3ff703617c80caf6dc31516ffbac0a3071bf75b3446bdce
3
+ size 359