Mohitcr1 commited on
Commit
83337b5
·
verified ·
1 Parent(s): c5a1f88

mistral-2nd

Browse files
README.md CHANGED
@@ -20,7 +20,7 @@ should probably proofread and complete it, then remove this comment. -->
20
 
21
  This model is a fine-tuned version of [mistralai/Mistral-7B-Instruct-v0.1](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.1) on the generator dataset.
22
  It achieves the following results on the evaluation set:
23
- - Loss: 0.9941
24
 
25
  ## Model description
26
 
@@ -52,11 +52,11 @@ The following hyperparameters were used during training:
52
 
53
  | Training Loss | Epoch | Step | Validation Loss |
54
  |:-------------:|:-----:|:----:|:---------------:|
55
- | 1.0455 | 1.25 | 20 | 0.9839 |
56
- | 0.8594 | 2.5 | 40 | 0.9259 |
57
- | 0.7674 | 3.75 | 60 | 0.9125 |
58
- | 0.6763 | 5.0 | 80 | 0.9213 |
59
- | 0.5779 | 6.25 | 100 | 0.9941 |
60
 
61
 
62
  ### Framework versions
 
20
 
21
  This model is a fine-tuned version of [mistralai/Mistral-7B-Instruct-v0.1](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.1) on the generator dataset.
22
  It achieves the following results on the evaluation set:
23
+ - Loss: 0.9820
24
 
25
  ## Model description
26
 
 
52
 
53
  | Training Loss | Epoch | Step | Validation Loss |
54
  |:-------------:|:-----:|:----:|:---------------:|
55
+ | 1.0672 | 1.25 | 20 | 0.9785 |
56
+ | 0.8414 | 2.5 | 40 | 0.9222 |
57
+ | 0.7461 | 3.75 | 60 | 0.9236 |
58
+ | 0.6702 | 5.0 | 80 | 0.9316 |
59
+ | 0.5844 | 6.25 | 100 | 0.9820 |
60
 
61
 
62
  ### Framework versions
adapter_config.json CHANGED
@@ -19,8 +19,8 @@
19
  "rank_pattern": {},
20
  "revision": null,
21
  "target_modules": [
22
- "v_proj",
23
- "q_proj"
24
  ],
25
  "task_type": "CAUSAL_LM",
26
  "use_dora": false,
 
19
  "rank_pattern": {},
20
  "revision": null,
21
  "target_modules": [
22
+ "q_proj",
23
+ "v_proj"
24
  ],
25
  "task_type": "CAUSAL_LM",
26
  "use_dora": false,
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:63d1b3fef86af3898d8f023f28c5025731427004b29bbb8f9ba48baa9c6e171d
3
  size 109069176
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:59b074adc3e9c2915cb49fbf7eaff5c2f5eafd0977c66cccf898ede5c7eb387f
3
  size 109069176
runs/Mar14_06-53-09_8422cc58f504/events.out.tfevents.1710399202.8422cc58f504.94.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7932a483158c3202f61b8ab278b3d888a3a3de06172937f97d0f60255da56949
3
+ size 8015
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:4c3dc62a01c9d3d21ba3a250c8f738e32c7dd05d5ac555c4b04c4c4cb7c0d316
3
  size 4728
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:57a62102659514689c773228ea8071a50f7af6375251a8225e6fc9c002f602ed
3
  size 4728