mistral-2nd

Files changed (5) hide show

README.md CHANGED Viewed

@@ -20,7 +20,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [mistralai/Mistral-7B-Instruct-v0.1](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.1) on the generator dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.9941
 ## Model description
@@ -52,11 +52,11 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 1.0455        | 1.25  | 20   | 0.9839          |
-| 0.8594        | 2.5   | 40   | 0.9259          |
-| 0.7674        | 3.75  | 60   | 0.9125          |
-| 0.6763        | 5.0   | 80   | 0.9213          |
-| 0.5779        | 6.25  | 100  | 0.9941          |
 ### Framework versions

 This model is a fine-tuned version of [mistralai/Mistral-7B-Instruct-v0.1](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.1) on the generator dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.9820
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 1.0672        | 1.25  | 20   | 0.9785          |
+| 0.8414        | 2.5   | 40   | 0.9222          |
+| 0.7461        | 3.75  | 60   | 0.9236          |
+| 0.6702        | 5.0   | 80   | 0.9316          |
+| 0.5844        | 6.25  | 100  | 0.9820          |
 ### Framework versions

adapter_config.json CHANGED Viewed

@@ -19,8 +19,8 @@
   "rank_pattern": {},
   "revision": null,
   "target_modules": [
-    "v_proj",
-    "q_proj"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

   "rank_pattern": {},
   "revision": null,
   "target_modules": [
+    "q_proj",
+    "v_proj"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:63d1b3fef86af3898d8f023f28c5025731427004b29bbb8f9ba48baa9c6e171d
 size 109069176

 version https://git-lfs.github.com/spec/v1
+oid sha256:59b074adc3e9c2915cb49fbf7eaff5c2f5eafd0977c66cccf898ede5c7eb387f
 size 109069176

runs/Mar14_06-53-09_8422cc58f504/events.out.tfevents.1710399202.8422cc58f504.94.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:7932a483158c3202f61b8ab278b3d888a3a3de06172937f97d0f60255da56949
+size 8015

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:4c3dc62a01c9d3d21ba3a250c8f738e32c7dd05d5ac555c4b04c4c4cb7c0d316
 size 4728

 version https://git-lfs.github.com/spec/v1
+oid sha256:57a62102659514689c773228ea8071a50f7af6375251a8225e6fc9c002f602ed
 size 4728