End of training

Files changed (5) hide show

README.md CHANGED Viewed

@@ -20,7 +20,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [mistralai/Mistral-7B-Instruct-v0.1](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.1) on the generator dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.0689
 ## Model description
@@ -52,17 +52,17 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 0.6675        | 2.86  | 20   | 0.5153          |
-| 0.1621        | 5.71  | 40   | 0.1178          |
-| 0.0369        | 8.57  | 60   | 0.0707          |
-| 0.0179        | 11.43 | 80   | 0.0652          |
-| 0.0138        | 14.29 | 100  | 0.0689          |
 ### Framework versions
-- PEFT 0.8.2
 - Transformers 4.37.0
 - Pytorch 2.1.2
-- Datasets 2.17.1
 - Tokenizers 0.15.1

 This model is a fine-tuned version of [mistralai/Mistral-7B-Instruct-v0.1](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.1) on the generator dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.0141
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 1.0245        | 1.25  | 20   | 0.9835          |
+| 0.833         | 2.5   | 40   | 0.9249          |
+| 0.7451        | 3.75  | 60   | 0.9172          |
+| 0.6741        | 5.0   | 80   | 0.9335          |
+| 0.5957        | 6.25  | 100  | 1.0141          |
 ### Framework versions
+- PEFT 0.9.0
 - Transformers 4.37.0
 - Pytorch 2.1.2
+- Datasets 2.18.0
 - Tokenizers 0.15.1

adapter_config.json CHANGED Viewed

@@ -23,5 +23,6 @@
     "q_proj"
   ],
   "task_type": "CAUSAL_LM",
   "use_rslora": false
 }

     "q_proj"
   ],
   "task_type": "CAUSAL_LM",
+  "use_dora": false,
   "use_rslora": false
 }

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:2a70529a6c16f35ba46dbc14d500e94fd67e85e5678d9913a10b9e51bb611f97
 size 109069176

 version https://git-lfs.github.com/spec/v1
+oid sha256:f25cb8bc11ff3405d94d8ef0da21908bffc976d06bae27d352da056451175a0d
 size 109069176

runs/Mar11_07-10-25_3ac72f125fd6/events.out.tfevents.1710141065.3ac72f125fd6.312.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:12e255bdf9c69ecfd6d20912348e15933779ceaaefcae679ba45b6ab34ee7bea
+size 8015

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:ffbee34eee499f41d4ea6ac62e262fae9bf0789c97a5f032d204aa261ad629d3
 size 4728

 version https://git-lfs.github.com/spec/v1
+oid sha256:5fd16ac66f553d6c601e2f5a75b1c9a9ef7e8fb05141fcba9499d58d44d98fef
 size 4728