minpeter
/

tiny-ko-sft

Text Generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions

minpeter commited on Jun 6, 2025

Commit

9f29ac7

·

verified ·

1 Parent(s): 974c016

End of training

Files changed (1) hide show

README.md +16 -16

README.md CHANGED Viewed

@@ -157,7 +157,7 @@ weight_decay: 0.0
 This model is a fine-tuned version of [minpeter/pretrained-tiny-ko](https://huggingface.co/minpeter/pretrained-tiny-ko) on the lemon-mint/Korean-FineTome-100k, the lemon-mint/smol-koreantalk, the heegyu/open-korean-instructions-v20231020, the FreedomIntelligence/evol-instruct-korean, the FreedomIntelligence/alpaca-gpt4-korean, the FreedomIntelligence/sharegpt-korean, the coastral/korean-writing-style-instruct and the devngho/korean-instruction-mix datasets.
 It achieves the following results on the evaluation set:
-- Loss: 1.5699
 ## Model description
@@ -194,21 +194,21 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
-| 2.8061        | 0.0010 | 1    | 2.8887          |
-| 1.9625        | 0.2019 | 200  | 1.9494          |
-| 1.8455        | 0.4037 | 400  | 1.8601          |
-| 1.7395        | 0.6056 | 600  | 1.8045          |
-| 1.7769        | 0.8075 | 800  | 1.7490          |
-| 1.5135        | 1.0091 | 1000 | 1.7116          |
-| 1.5928        | 1.2110 | 1200 | 1.6860          |
-| 1.5322        | 1.4128 | 1400 | 1.6517          |
-| 1.4939        | 1.6147 | 1600 | 1.6218          |
-| 1.4406        | 1.8166 | 1800 | 1.5939          |
-| 1.3999        | 2.0182 | 2000 | 1.5841          |
-| 1.3449        | 2.2200 | 2200 | 1.5770          |
-| 1.2352        | 2.4219 | 2400 | 1.5723          |
-| 1.3043        | 2.6238 | 2600 | 1.5702          |
-| 1.3467        | 2.8256 | 2800 | 1.5699          |
 ### Framework versions

 This model is a fine-tuned version of [minpeter/pretrained-tiny-ko](https://huggingface.co/minpeter/pretrained-tiny-ko) on the lemon-mint/Korean-FineTome-100k, the lemon-mint/smol-koreantalk, the heegyu/open-korean-instructions-v20231020, the FreedomIntelligence/evol-instruct-korean, the FreedomIntelligence/alpaca-gpt4-korean, the FreedomIntelligence/sharegpt-korean, the coastral/korean-writing-style-instruct and the devngho/korean-instruction-mix datasets.
 It achieves the following results on the evaluation set:
+- Loss: 1.4634
 ## Model description
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
+| 2.696         | 0.0010 | 1    | 2.7432          |
+| 1.7677        | 0.2019 | 200  | 1.7528          |
+| 1.6696        | 0.4037 | 400  | 1.6833          |
+| 1.5866        | 0.6056 | 600  | 1.6401          |
+| 1.6249        | 0.8075 | 800  | 1.5957          |
+| 1.3578        | 1.0091 | 1000 | 1.5704          |
+| 1.4469        | 1.2110 | 1200 | 1.5514          |
+| 1.3969        | 1.4128 | 1400 | 1.5220          |
+| 1.3549        | 1.6147 | 1600 | 1.4939          |
+| 1.3107        | 1.8166 | 1800 | 1.4695          |
+| 1.2462        | 2.0182 | 2000 | 1.4751          |
+| 1.2001        | 2.2200 | 2200 | 1.4692          |
+| 1.0911        | 2.4219 | 2400 | 1.4661          |
+| 1.1547        | 2.6238 | 2600 | 1.4636          |
+| 1.1943        | 2.8256 | 2800 | 1.4634          |
 ### Framework versions