End of training

Files changed (4) hide show

README.md CHANGED Viewed

@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [roberta-base](https://huggingface.co/roberta-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.5492
 ## Model description
@@ -35,11 +35,11 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 0.0005
-- train_batch_size: 32
-- eval_batch_size: 32
 - seed: 42
-- gradient_accumulation_steps: 11
-- total_train_batch_size: 352
 - optimizer: Adam with betas=(0.9,0.98) and epsilon=1e-06
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_ratio: 0.06
@@ -49,14 +49,12 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 1.93          | 0.97  | 27   | 1.6121          |
-| 1.6683        | 1.98  | 55   | 1.5535          |
-| 1.6252        | 2.99  | 83   | 1.5258          |
-| 1.6651        | 3.97  | 110  | 1.5424          |
-| 1.6085        | 4.98  | 138  | 1.5716          |
-| 1.6078        | 5.99  | 166  | 1.5710          |
-| 1.6158        | 7.0   | 194  | 1.5807          |
-| 1.6491        | 7.97  | 221  | 1.5848          |
 ### Framework versions

 This model is a fine-tuned version of [roberta-base](https://huggingface.co/roberta-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.5970
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 0.0005
+- train_batch_size: 21
+- eval_batch_size: 21
 - seed: 42
+- gradient_accumulation_steps: 2
+- total_train_batch_size: 42
 - optimizer: Adam with betas=(0.9,0.98) and epsilon=1e-06
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_ratio: 0.06
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 1.7917        | 1.0   | 232  | 1.5959          |
+| 1.7109        | 2.0   | 465  | 1.6216          |
+| 1.7571        | 3.0   | 697  | 1.6839          |
+| 1.8098        | 4.0   | 930  | 1.7498          |
+| 1.9035        | 5.0   | 1162 | 1.8368          |
+| 1.9617        | 6.0   | 1395 | 1.9273          |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:06e752298faeb7db05bcaabe89a32a756a27e99b1a3cfd8d11f49aaed472f5b2
 size 498813948

 version https://git-lfs.github.com/spec/v1
+oid sha256:02ff596dc4f8ca4004dc31a28166909afb72b836e9ac5845ba74bf3b001338c3
 size 498813948

runs/Apr16_13-28-55_3749cd13d26a/events.out.tfevents.1713274137.3749cd13d26a.245.0 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:f13f38151169e06cb64a3b10d965d7ca6391938d2e6f39a374939fc40375031a
-size 7554

 version https://git-lfs.github.com/spec/v1
+oid sha256:8ee412ca9d4b9f72aca8cc505a611dbfe520a3200b4ef297af11509dcabb56d6
+size 7908

runs/Apr16_13-28-55_3749cd13d26a/events.out.tfevents.1713274972.3749cd13d26a.245.1 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:49455b7f8ceb4849a3ff703617c80caf6dc31516ffbac0a3071bf75b3446bdce
+size 359