minpeter commited on
Commit
9f29ac7
·
verified ·
1 Parent(s): 974c016

End of training

Browse files
Files changed (1) hide show
  1. README.md +16 -16
README.md CHANGED
@@ -157,7 +157,7 @@ weight_decay: 0.0
157
 
158
  This model is a fine-tuned version of [minpeter/pretrained-tiny-ko](https://huggingface.co/minpeter/pretrained-tiny-ko) on the lemon-mint/Korean-FineTome-100k, the lemon-mint/smol-koreantalk, the heegyu/open-korean-instructions-v20231020, the FreedomIntelligence/evol-instruct-korean, the FreedomIntelligence/alpaca-gpt4-korean, the FreedomIntelligence/sharegpt-korean, the coastral/korean-writing-style-instruct and the devngho/korean-instruction-mix datasets.
159
  It achieves the following results on the evaluation set:
160
- - Loss: 1.5699
161
 
162
  ## Model description
163
 
@@ -194,21 +194,21 @@ The following hyperparameters were used during training:
194
 
195
  | Training Loss | Epoch | Step | Validation Loss |
196
  |:-------------:|:------:|:----:|:---------------:|
197
- | 2.8061 | 0.0010 | 1 | 2.8887 |
198
- | 1.9625 | 0.2019 | 200 | 1.9494 |
199
- | 1.8455 | 0.4037 | 400 | 1.8601 |
200
- | 1.7395 | 0.6056 | 600 | 1.8045 |
201
- | 1.7769 | 0.8075 | 800 | 1.7490 |
202
- | 1.5135 | 1.0091 | 1000 | 1.7116 |
203
- | 1.5928 | 1.2110 | 1200 | 1.6860 |
204
- | 1.5322 | 1.4128 | 1400 | 1.6517 |
205
- | 1.4939 | 1.6147 | 1600 | 1.6218 |
206
- | 1.4406 | 1.8166 | 1800 | 1.5939 |
207
- | 1.3999 | 2.0182 | 2000 | 1.5841 |
208
- | 1.3449 | 2.2200 | 2200 | 1.5770 |
209
- | 1.2352 | 2.4219 | 2400 | 1.5723 |
210
- | 1.3043 | 2.6238 | 2600 | 1.5702 |
211
- | 1.3467 | 2.8256 | 2800 | 1.5699 |
212
 
213
 
214
  ### Framework versions
 
157
 
158
  This model is a fine-tuned version of [minpeter/pretrained-tiny-ko](https://huggingface.co/minpeter/pretrained-tiny-ko) on the lemon-mint/Korean-FineTome-100k, the lemon-mint/smol-koreantalk, the heegyu/open-korean-instructions-v20231020, the FreedomIntelligence/evol-instruct-korean, the FreedomIntelligence/alpaca-gpt4-korean, the FreedomIntelligence/sharegpt-korean, the coastral/korean-writing-style-instruct and the devngho/korean-instruction-mix datasets.
159
  It achieves the following results on the evaluation set:
160
+ - Loss: 1.4634
161
 
162
  ## Model description
163
 
 
194
 
195
  | Training Loss | Epoch | Step | Validation Loss |
196
  |:-------------:|:------:|:----:|:---------------:|
197
+ | 2.696 | 0.0010 | 1 | 2.7432 |
198
+ | 1.7677 | 0.2019 | 200 | 1.7528 |
199
+ | 1.6696 | 0.4037 | 400 | 1.6833 |
200
+ | 1.5866 | 0.6056 | 600 | 1.6401 |
201
+ | 1.6249 | 0.8075 | 800 | 1.5957 |
202
+ | 1.3578 | 1.0091 | 1000 | 1.5704 |
203
+ | 1.4469 | 1.2110 | 1200 | 1.5514 |
204
+ | 1.3969 | 1.4128 | 1400 | 1.5220 |
205
+ | 1.3549 | 1.6147 | 1600 | 1.4939 |
206
+ | 1.3107 | 1.8166 | 1800 | 1.4695 |
207
+ | 1.2462 | 2.0182 | 2000 | 1.4751 |
208
+ | 1.2001 | 2.2200 | 2200 | 1.4692 |
209
+ | 1.0911 | 2.4219 | 2400 | 1.4661 |
210
+ | 1.1547 | 2.6238 | 2600 | 1.4636 |
211
+ | 1.1943 | 2.8256 | 2800 | 1.4634 |
212
 
213
 
214
  ### Framework versions