1em0n
/

results

Text Generation

Generated from Trainer

Model card Files Files and versions

1em0n commited on Sep 11, 2024

Commit

14e04ee

·

verified ·

1 Parent(s): 82617bd

Model save

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [ridger/MMfreeLM-370M](https://huggingface.co/ridger/MMfreeLM-370M) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 2.8998
 ## Model description
@@ -34,7 +34,7 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 0.0001
 - train_batch_size: 8
 - eval_batch_size: 8
 - seed: 42
@@ -49,7 +49,7 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
-| 2.7555        | 2.7982 | 5000 | 2.9058          |
 ### Framework versions

 This model is a fine-tuned version of [ridger/MMfreeLM-370M](https://huggingface.co/ridger/MMfreeLM-370M) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 3.2207
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 0.004
 - train_batch_size: 8
 - eval_batch_size: 8
 - seed: 42
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
+| 3.2727        | 2.7982 | 5000 | 3.4141          |
 ### Framework versions