minpeter commited on
Commit
96a295b
·
verified ·
1 Parent(s): 5998776

End of training

Browse files
Files changed (1) hide show
  1. README.md +16 -16
README.md CHANGED
@@ -157,7 +157,7 @@ weight_decay: 0.0
157
 
158
  This model is a fine-tuned version of [minpeter/pretrained-tiny-ko](https://huggingface.co/minpeter/pretrained-tiny-ko) on the lemon-mint/Korean-FineTome-100k, the lemon-mint/smol-koreantalk, the heegyu/open-korean-instructions-v20231020, the FreedomIntelligence/evol-instruct-korean, the FreedomIntelligence/alpaca-gpt4-korean, the FreedomIntelligence/sharegpt-korean, the coastral/korean-writing-style-instruct and the devngho/korean-instruction-mix datasets.
159
  It achieves the following results on the evaluation set:
160
- - Loss: 1.4286
161
 
162
  ## Model description
163
 
@@ -194,21 +194,21 @@ The following hyperparameters were used during training:
194
 
195
  | Training Loss | Epoch | Step | Validation Loss |
196
  |:-------------:|:------:|:----:|:---------------:|
197
- | 2.9956 | 0.0010 | 1 | 3.0182 |
198
- | 1.7162 | 0.2019 | 200 | 1.7023 |
199
- | 1.6186 | 0.4037 | 400 | 1.6351 |
200
- | 1.5474 | 0.6056 | 600 | 1.5951 |
201
- | 1.5822 | 0.8075 | 800 | 1.5540 |
202
- | 1.3144 | 1.0091 | 1000 | 1.5333 |
203
- | 1.403 | 1.2110 | 1200 | 1.5128 |
204
- | 1.3558 | 1.4128 | 1400 | 1.4832 |
205
- | 1.3165 | 1.6147 | 1600 | 1.4541 |
206
- | 1.2704 | 1.8166 | 1800 | 1.4305 |
207
- | 1.1913 | 2.0182 | 2000 | 1.4424 |
208
- | 1.1488 | 2.2200 | 2200 | 1.4346 |
209
- | 1.0417 | 2.4219 | 2400 | 1.4311 |
210
- | 1.1104 | 2.6238 | 2600 | 1.4288 |
211
- | 1.1446 | 2.8256 | 2800 | 1.4286 |
212
 
213
 
214
  ### Framework versions
 
157
 
158
  This model is a fine-tuned version of [minpeter/pretrained-tiny-ko](https://huggingface.co/minpeter/pretrained-tiny-ko) on the lemon-mint/Korean-FineTome-100k, the lemon-mint/smol-koreantalk, the heegyu/open-korean-instructions-v20231020, the FreedomIntelligence/evol-instruct-korean, the FreedomIntelligence/alpaca-gpt4-korean, the FreedomIntelligence/sharegpt-korean, the coastral/korean-writing-style-instruct and the devngho/korean-instruction-mix datasets.
159
  It achieves the following results on the evaluation set:
160
+ - Loss: 1.4059
161
 
162
  ## Model description
163
 
 
194
 
195
  | Training Loss | Epoch | Step | Validation Loss |
196
  |:-------------:|:------:|:----:|:---------------:|
197
+ | 3.9539 | 0.0010 | 1 | 3.9757 |
198
+ | 1.6999 | 0.2019 | 200 | 1.6884 |
199
+ | 1.6123 | 0.4037 | 400 | 1.6288 |
200
+ | 1.5387 | 0.6056 | 600 | 1.5876 |
201
+ | 1.5681 | 0.8075 | 800 | 1.5429 |
202
+ | 1.3066 | 1.0091 | 1000 | 1.5208 |
203
+ | 1.395 | 1.2110 | 1200 | 1.5007 |
204
+ | 1.3474 | 1.4128 | 1400 | 1.4699 |
205
+ | 1.3025 | 1.6147 | 1600 | 1.4383 |
206
+ | 1.2566 | 1.8166 | 1800 | 1.4117 |
207
+ | 1.1672 | 2.0182 | 2000 | 1.4227 |
208
+ | 1.1267 | 2.2200 | 2200 | 1.4141 |
209
+ | 1.0195 | 2.4219 | 2400 | 1.4098 |
210
+ | 1.084 | 2.6238 | 2600 | 1.4063 |
211
+ | 1.1254 | 2.8256 | 2800 | 1.4059 |
212
 
213
 
214
  ### Framework versions