Mohitcr1 commited on
Commit
99c9887
·
verified ·
1 Parent(s): efd8342

mistral-0.2-fp

Browse files
README.md CHANGED
@@ -20,7 +20,7 @@ should probably proofread and complete it, then remove this comment. -->
20
 
21
  This model is a fine-tuned version of [mistralai/Mistral-7B-Instruct-v0.2](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.2) on the generator dataset.
22
  It achieves the following results on the evaluation set:
23
- - Loss: 0.8049
24
 
25
  ## Model description
26
 
@@ -52,11 +52,11 @@ The following hyperparameters were used during training:
52
 
53
  | Training Loss | Epoch | Step | Validation Loss |
54
  |:-------------:|:-----:|:----:|:---------------:|
55
- | 1.0509 | 0.87 | 20 | 0.8995 |
56
- | 0.8839 | 1.74 | 40 | 0.8167 |
57
- | 0.8105 | 2.61 | 60 | 0.7898 |
58
- | 0.7148 | 3.48 | 80 | 0.7915 |
59
- | 0.6221 | 4.35 | 100 | 0.8049 |
60
 
61
 
62
  ### Framework versions
 
20
 
21
  This model is a fine-tuned version of [mistralai/Mistral-7B-Instruct-v0.2](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.2) on the generator dataset.
22
  It achieves the following results on the evaluation set:
23
+ - Loss: 0.9395
24
 
25
  ## Model description
26
 
 
52
 
53
  | Training Loss | Epoch | Step | Validation Loss |
54
  |:-------------:|:-----:|:----:|:---------------:|
55
+ | 1.1508 | 0.67 | 20 | 1.0891 |
56
+ | 0.9774 | 1.33 | 40 | 1.0043 |
57
+ | 0.9105 | 2.0 | 60 | 0.9710 |
58
+ | 0.8641 | 2.67 | 80 | 0.9539 |
59
+ | 0.8492 | 3.33 | 100 | 0.9395 |
60
 
61
 
62
  ### Framework versions
adapter_config.json CHANGED
@@ -20,8 +20,8 @@
20
  "rank_pattern": {},
21
  "revision": null,
22
  "target_modules": [
23
- "q_proj",
24
- "v_proj"
25
  ],
26
  "task_type": "CAUSAL_LM",
27
  "use_dora": false,
 
20
  "rank_pattern": {},
21
  "revision": null,
22
  "target_modules": [
23
+ "v_proj",
24
+ "q_proj"
25
  ],
26
  "task_type": "CAUSAL_LM",
27
  "use_dora": false,
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:bff8112d95bc932ce88e61a9b6fb54a3b33ddc96ca50e1ead366158968c8cfbd
3
  size 109069176
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:98956b458582df16dab7f2c373aeb60f751b6d8ba2cc240329a1f83086e84bed
3
  size 109069176
runs/Apr10_13-33-10_ab5a3d800f7a/events.out.tfevents.1712756003.ab5a3d800f7a.89.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1c23f3506a5236da3bb3eb3f1aeb7e77cbbedbafb6a7fadd86a64caa23dd0f48
3
+ size 8813
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:f59623b10aaf9dc82e1c5c843c17496f95721a4649edec88c7620ffaeadbe4b9
3
  size 4920
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d466f979464643ae76daa15df67a23c2a12ca3b700a1dd21b2ed43f17dc6cecb
3
  size 4920