Anixyz commited on
Commit
e987145
·
verified ·
1 Parent(s): 4361849

End of training

Browse files
README.md CHANGED
@@ -20,7 +20,7 @@ should probably proofread and complete it, then remove this comment. -->
20
 
21
  This model is a fine-tuned version of [mistralai/Mistral-7B-v0.3](https://huggingface.co/mistralai/Mistral-7B-v0.3) on the generator dataset.
22
  It achieves the following results on the evaluation set:
23
- - Loss: 1.2082
24
 
25
  ## Model description
26
 
@@ -51,16 +51,27 @@ The following hyperparameters were used during training:
51
 
52
  | Training Loss | Epoch | Step | Validation Loss |
53
  |:-------------:|:------:|:----:|:---------------:|
54
- | 1.2943 | 0.1852 | 20 | 1.2382 |
55
- | 1.2578 | 0.3704 | 40 | 1.2178 |
56
- | 1.224 | 0.5556 | 60 | 1.2128 |
57
- | 1.1958 | 0.7407 | 80 | 1.2094 |
58
- | 1.1841 | 0.9259 | 100 | 1.2078 |
59
- | 1.1847 | 1.1111 | 120 | 1.2071 |
60
- | 1.1636 | 1.2963 | 140 | 1.2076 |
61
- | 1.17 | 1.4815 | 160 | 1.2084 |
62
- | 1.1188 | 1.6667 | 180 | 1.2083 |
63
- | 1.1492 | 1.8519 | 200 | 1.2082 |
 
 
 
 
 
 
 
 
 
 
 
64
 
65
 
66
  ### Framework versions
 
20
 
21
  This model is a fine-tuned version of [mistralai/Mistral-7B-v0.3](https://huggingface.co/mistralai/Mistral-7B-v0.3) on the generator dataset.
22
  It achieves the following results on the evaluation set:
23
+ - Loss: 1.2216
24
 
25
  ## Model description
26
 
 
51
 
52
  | Training Loss | Epoch | Step | Validation Loss |
53
  |:-------------:|:------:|:----:|:---------------:|
54
+ | 1.3113 | 0.0926 | 50 | 1.2471 |
55
+ | 1.2364 | 0.1852 | 100 | 1.2382 |
56
+ | 1.2241 | 0.2778 | 150 | 1.2332 |
57
+ | 1.2608 | 0.3704 | 200 | 1.2313 |
58
+ | 1.2333 | 0.4630 | 250 | 1.2288 |
59
+ | 1.2352 | 0.5556 | 300 | 1.2272 |
60
+ | 1.2323 | 0.6481 | 350 | 1.2256 |
61
+ | 1.2482 | 0.7407 | 400 | 1.2238 |
62
+ | 1.2104 | 0.8333 | 450 | 1.2227 |
63
+ | 1.2348 | 0.9259 | 500 | 1.2208 |
64
+ | 1.2176 | 1.0185 | 550 | 1.2208 |
65
+ | 1.1914 | 1.1111 | 600 | 1.2219 |
66
+ | 1.1972 | 1.2037 | 650 | 1.2230 |
67
+ | 1.1815 | 1.2963 | 700 | 1.2226 |
68
+ | 1.1838 | 1.3889 | 750 | 1.2230 |
69
+ | 1.2029 | 1.4815 | 800 | 1.2225 |
70
+ | 1.1571 | 1.5741 | 850 | 1.2224 |
71
+ | 1.1575 | 1.6667 | 900 | 1.2221 |
72
+ | 1.18 | 1.7593 | 950 | 1.2218 |
73
+ | 1.1708 | 1.8519 | 1000 | 1.2217 |
74
+ | 1.1513 | 1.9444 | 1050 | 1.2216 |
75
 
76
 
77
  ### Framework versions
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:4552f00b48de363e040b89c60ed568deb179cfdf0651d114ce740e585ab30fc7
3
  size 13648432
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:cc4a55e4a5647ad0e1235bbef2787eb89d14a4619165e0ff102483d508c7dcf1
3
  size 13648432
runs/Oct07_14-49-26_ab3e70a80cec/events.out.tfevents.1759848573.ab3e70a80cec.2564.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:e925bbcf1e68433ff8c68b4bec1af040517afb33d2c566935e4a7113de062b80
3
- size 15055
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:640c0a29a129a51b2cbe30236679935a643c59dfaa78b200bad4426fc4527261
3
+ size 15891