aisuko commited on
Commit
300799f
·
verified ·
1 Parent(s): 3eb63ca

End of training

Browse files
README.md CHANGED
@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
16
 
17
  This model is a fine-tuned version of [mistralai/Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) on the OpenAssistant/oasst_top1_2023-08-25 dataset.
18
  It achieves the following results on the evaluation set:
19
- - Loss: 1.9883
20
 
21
  ## Model description
22
 
@@ -50,9 +50,9 @@ The following hyperparameters were used during training:
50
 
51
  | Training Loss | Epoch | Step | Validation Loss |
52
  |:-------------:|:-----:|:----:|:---------------:|
53
- | 2.4057 | 1.0 | 56 | 1.5748 |
54
- | 0.924 | 1.99 | 112 | 1.7516 |
55
- | 0.6519 | 2.99 | 168 | 1.9883 |
56
 
57
 
58
  ### Framework versions
 
16
 
17
  This model is a fine-tuned version of [mistralai/Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) on the OpenAssistant/oasst_top1_2023-08-25 dataset.
18
  It achieves the following results on the evaluation set:
19
+ - Loss: 2.1361
20
 
21
  ## Model description
22
 
 
50
 
51
  | Training Loss | Epoch | Step | Validation Loss |
52
  |:-------------:|:-----:|:----:|:---------------:|
53
+ | 1.8994 | 1.0 | 56 | 1.7425 |
54
+ | 0.8603 | 1.99 | 112 | 1.9602 |
55
+ | 0.5465 | 2.99 | 168 | 2.1361 |
56
 
57
 
58
  ### Framework versions
adapter_config.json CHANGED
@@ -22,13 +22,13 @@
22
  "rank_pattern": {},
23
  "revision": null,
24
  "target_modules": [
25
- "gate_proj",
26
  "o_proj",
27
- "down_proj",
28
  "q_proj",
 
29
  "up_proj",
30
- "k_proj",
31
- "v_proj"
32
  ],
33
  "task_type": "CAUSAL_LM"
34
  }
 
22
  "rank_pattern": {},
23
  "revision": null,
24
  "target_modules": [
 
25
  "o_proj",
26
+ "v_proj",
27
  "q_proj",
28
+ "gate_proj",
29
  "up_proj",
30
+ "down_proj",
31
+ "k_proj"
32
  ],
33
  "task_type": "CAUSAL_LM"
34
  }
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:ca28466ddc70fde4aaff2e648e9219a65249474dc6635e8f7aa08a847dd7aabc
3
  size 1719791960
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a1024314204b45adf00b740187771a2497b34a8408bb4e3ebf72bbca2ec7e5b6
3
  size 1719791960
runs/Mar04_23-10-59_e67be63dcafc/events.out.tfevents.1709593861.e67be63dcafc.27.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0970c2605df87cd85685579aa624c23fbb61f1d5a902625e22b656a819a73a97
3
+ size 31910
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:19f2cac4fd2f80424b260249424878d6b79ec306232cc1158e822e887e8f5f08
3
  size 4728
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:59c77171bacb3634b6bdf042704eb85ba7f19b505a5946fc0961b399778dbe18
3
  size 4728