mistral-0.2-fp

Files changed (5) hide show

README.md CHANGED Viewed

@@ -20,7 +20,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [mistralai/Mistral-7B-Instruct-v0.2](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.2) on the generator dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.8049
 ## Model description
@@ -52,11 +52,11 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 1.0509        | 0.87  | 20   | 0.8995          |
-| 0.8839        | 1.74  | 40   | 0.8167          |
-| 0.8105        | 2.61  | 60   | 0.7898          |
-| 0.7148        | 3.48  | 80   | 0.7915          |
-| 0.6221        | 4.35  | 100  | 0.8049          |
 ### Framework versions

 This model is a fine-tuned version of [mistralai/Mistral-7B-Instruct-v0.2](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.2) on the generator dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.9395
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 1.1508        | 0.67  | 20   | 1.0891          |
+| 0.9774        | 1.33  | 40   | 1.0043          |
+| 0.9105        | 2.0   | 60   | 0.9710          |
+| 0.8641        | 2.67  | 80   | 0.9539          |
+| 0.8492        | 3.33  | 100  | 0.9395          |
 ### Framework versions

adapter_config.json CHANGED Viewed

@@ -20,8 +20,8 @@
   "rank_pattern": {},
   "revision": null,
   "target_modules": [
-    "q_proj",
-    "v_proj"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

   "rank_pattern": {},
   "revision": null,
   "target_modules": [
+    "v_proj",
+    "q_proj"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:bff8112d95bc932ce88e61a9b6fb54a3b33ddc96ca50e1ead366158968c8cfbd
 size 109069176

 version https://git-lfs.github.com/spec/v1
+oid sha256:98956b458582df16dab7f2c373aeb60f751b6d8ba2cc240329a1f83086e84bed
 size 109069176

runs/Apr10_13-33-10_ab5a3d800f7a/events.out.tfevents.1712756003.ab5a3d800f7a.89.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:1c23f3506a5236da3bb3eb3f1aeb7e77cbbedbafb6a7fadd86a64caa23dd0f48
+size 8813

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:f59623b10aaf9dc82e1c5c843c17496f95721a4649edec88c7620ffaeadbe4b9
 size 4920

 version https://git-lfs.github.com/spec/v1
+oid sha256:d466f979464643ae76daa15df67a23c2a12ca3b700a1dd21b2ed43f17dc6cecb
 size 4920