End of training

Files changed (5) hide show

README.md CHANGED Viewed

@@ -18,7 +18,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [mistralai/Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.6037
 ## Model description
@@ -43,15 +43,18 @@ The following hyperparameters were used during training:
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
-- num_epochs: 3
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 1.7989        | 1.0   | 576  | 1.6138          |
-| 1.5588        | 2.0   | 1152 | 1.6049          |
-| 1.3794        | 3.0   | 1728 | 1.6037          |
 ### Framework versions

 This model is a fine-tuned version of [mistralai/Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.3632
 ## Model description
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
+- num_epochs: 6
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 1.3736        | 1.0   | 179  | 1.3915          |
+| 1.4252        | 2.0   | 358  | 1.3709          |
+| 1.2687        | 3.0   | 537  | 1.3638          |
+| 1.414         | 4.0   | 716  | 1.3626          |
+| 1.3064        | 5.0   | 895  | 1.3631          |
+| 1.2414        | 6.0   | 1074 | 1.3632          |
 ### Framework versions

adapter_config.json CHANGED Viewed

@@ -20,9 +20,9 @@
   "rank_pattern": {},
   "revision": null,
   "target_modules": [
     "k_proj",
     "o_proj",
-    "q_proj",
     "v_proj"
   ],
   "task_type": "CAUSAL_LM",

   "rank_pattern": {},
   "revision": null,
   "target_modules": [
+    "q_proj",
     "k_proj",
     "o_proj",
     "v_proj"
   ],
   "task_type": "CAUSAL_LM",

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:aa72659c59bfdaf6f4c050fe95644668648a7f281255c5e293684424b5247b01
 size 109086672

 version https://git-lfs.github.com/spec/v1
+oid sha256:b48d41e2f183861d9ffe8f2c349cae94a0637fb1ce0c03629316c4a682c48aaa
 size 109086672

tokenizer.json CHANGED Viewed

@@ -1,11 +1,6 @@
 {
   "version": "1.0",
-  "truncation": {
-    "direction": "Right",
-    "max_length": 2048,
-    "strategy": "LongestFirst",
-    "stride": 0
-  },
   "padding": null,
   "added_tokens": [
     {

 {
   "version": "1.0",
+  "truncation": null,
   "padding": null,
   "added_tokens": [
     {

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:ad118be907541be3a0929d88a31a1787801d0eeb812aac3f0caa0b7b53072ee8
 size 5496

 version https://git-lfs.github.com/spec/v1
+oid sha256:6cb640582263a907080612c0893444a3f752219ce1c3aaffd6e493a4a21fa0be
 size 5496