CLASS-MATE
/

BERT-MLM

Generated from Trainer

Model card Files Files and versions

Metrics Training metrics Community

AmalNlal commited on Mar 6, 2024

Commit

25f802a

·

verified ·

1 Parent(s): f23d610

End of training

Files changed (1) hide show

README.md +8 -8

README.md CHANGED Viewed

@@ -13,7 +13,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [](https://huggingface.co/) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 5.6008
 ## Model description
@@ -33,8 +33,8 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 0.01
-- train_batch_size: 32
-- eval_batch_size: 32
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
@@ -44,11 +44,11 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 5.8219        | 0.12  | 100  | 5.6243          |
-| 5.6177        | 0.25  | 200  | 5.6153          |
-| 5.6121        | 0.37  | 300  | 5.6094          |
-| 5.6131        | 0.49  | 400  | 5.6055          |
-| 5.6           | 0.62  | 500  | 5.6008          |
 ### Framework versions

 This model is a fine-tuned version of [](https://huggingface.co/) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 7.9301
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 0.01
+- train_batch_size: 16
+- eval_batch_size: 16
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 9.2035        | 0.06  | 100  | 8.1180          |
+| 8.0969        | 0.12  | 200  | 8.0501          |
+| 8.002         | 0.19  | 300  | 8.0142          |
+| 8.0026        | 0.25  | 400  | 7.9677          |
+| 7.9075        | 0.31  | 500  | 7.9301          |
 ### Framework versions