End of training

Files changed (5) hide show

README.md CHANGED Viewed

@@ -20,7 +20,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [mistralai/Mistral-7B-v0.3](https://huggingface.co/mistralai/Mistral-7B-v0.3) on the generator dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.3306
 ## Model description
@@ -39,7 +39,7 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 0.0005
 - train_batch_size: 8
 - eval_batch_size: 8
 - seed: 42
@@ -51,12 +51,7 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
-| 1.4488        | 0.3333 | 10   | 1.3641          |
-| 1.3617        | 0.6667 | 20   | 1.3351          |
-| 1.2343        | 1.0    | 30   | 1.3294          |
-| 1.2651        | 1.3333 | 40   | 1.3317          |
-| 1.1214        | 1.6667 | 50   | 1.3311          |
-| 1.1861        | 2.0    | 60   | 1.3306          |
 ### Framework versions
@@ -64,5 +59,5 @@ The following hyperparameters were used during training:
 - PEFT 0.11.1
 - Transformers 4.40.1
 - Pytorch 2.3.1+cu121
-- Datasets 4.0.0
 - Tokenizers 0.19.1

 This model is a fine-tuned version of [mistralai/Mistral-7B-v0.3](https://huggingface.co/mistralai/Mistral-7B-v0.3) on the generator dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.2867
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 2e-05
 - train_batch_size: 8
 - eval_batch_size: 8
 - seed: 42
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
+| 1.3323        | 1.8519 | 200  | 1.2867          |
 ### Framework versions
 - PEFT 0.11.1
 - Transformers 4.40.1
 - Pytorch 2.3.1+cu121
+- Datasets 4.1.1
 - Tokenizers 0.19.1

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:42826e48bf001b2293f2c5c72981e86ba008602cf4571a1419762bfaa445406d
 size 13648432

 version https://git-lfs.github.com/spec/v1
+oid sha256:131e9879d6649fa607d512256e9a8d6ca401da1dabf00eb0e82c50fda0a56296
 size 13648432

runs/Oct07_08-24-29_7cb473712ebb/events.out.tfevents.1759825483.7cb473712ebb.8026.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:5b1213480b27cd038916ca4df19746f3dbfd9841ef052fe7e3f4bc70d01e8a60
+size 5433

runs/Oct07_08-33-19_7cb473712ebb/events.out.tfevents.1759826001.7cb473712ebb.8026.1 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:06fe415713b9fda84c30a68c43b2c420b6de90ce8b64a1adb3ab51cd59dfdfa1
+size 6269

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:f6f5ca2da79a90712e6af5e470e79f62bab71f9101fce7e065c6734966efd395
 size 5240

 version https://git-lfs.github.com/spec/v1
+oid sha256:e57960b4403e8409c0e89eb7cc0577ce4b7cf0841edeb44ee2b5fa7b9a44a91e
 size 5240