minpeter
/

tiny-ko-sft

Text Generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions

minpeter commited on Jun 8, 2025

Commit

96a295b

·

verified ·

1 Parent(s): 5998776

End of training

Files changed (1) hide show

README.md +16 -16

README.md CHANGED Viewed

@@ -157,7 +157,7 @@ weight_decay: 0.0
 This model is a fine-tuned version of [minpeter/pretrained-tiny-ko](https://huggingface.co/minpeter/pretrained-tiny-ko) on the lemon-mint/Korean-FineTome-100k, the lemon-mint/smol-koreantalk, the heegyu/open-korean-instructions-v20231020, the FreedomIntelligence/evol-instruct-korean, the FreedomIntelligence/alpaca-gpt4-korean, the FreedomIntelligence/sharegpt-korean, the coastral/korean-writing-style-instruct and the devngho/korean-instruction-mix datasets.
 It achieves the following results on the evaluation set:
-- Loss: 1.4286
 ## Model description
@@ -194,21 +194,21 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
-| 2.9956        | 0.0010 | 1    | 3.0182          |
-| 1.7162        | 0.2019 | 200  | 1.7023          |
-| 1.6186        | 0.4037 | 400  | 1.6351          |
-| 1.5474        | 0.6056 | 600  | 1.5951          |
-| 1.5822        | 0.8075 | 800  | 1.5540          |
-| 1.3144        | 1.0091 | 1000 | 1.5333          |
-| 1.403         | 1.2110 | 1200 | 1.5128          |
-| 1.3558        | 1.4128 | 1400 | 1.4832          |
-| 1.3165        | 1.6147 | 1600 | 1.4541          |
-| 1.2704        | 1.8166 | 1800 | 1.4305          |
-| 1.1913        | 2.0182 | 2000 | 1.4424          |
-| 1.1488        | 2.2200 | 2200 | 1.4346          |
-| 1.0417        | 2.4219 | 2400 | 1.4311          |
-| 1.1104        | 2.6238 | 2600 | 1.4288          |
-| 1.1446        | 2.8256 | 2800 | 1.4286          |
 ### Framework versions

 This model is a fine-tuned version of [minpeter/pretrained-tiny-ko](https://huggingface.co/minpeter/pretrained-tiny-ko) on the lemon-mint/Korean-FineTome-100k, the lemon-mint/smol-koreantalk, the heegyu/open-korean-instructions-v20231020, the FreedomIntelligence/evol-instruct-korean, the FreedomIntelligence/alpaca-gpt4-korean, the FreedomIntelligence/sharegpt-korean, the coastral/korean-writing-style-instruct and the devngho/korean-instruction-mix datasets.
 It achieves the following results on the evaluation set:
+- Loss: 1.4059
 ## Model description
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
+| 3.9539        | 0.0010 | 1    | 3.9757          |
+| 1.6999        | 0.2019 | 200  | 1.6884          |
+| 1.6123        | 0.4037 | 400  | 1.6288          |
+| 1.5387        | 0.6056 | 600  | 1.5876          |
+| 1.5681        | 0.8075 | 800  | 1.5429          |
+| 1.3066        | 1.0091 | 1000 | 1.5208          |
+| 1.395         | 1.2110 | 1200 | 1.5007          |
+| 1.3474        | 1.4128 | 1400 | 1.4699          |
+| 1.3025        | 1.6147 | 1600 | 1.4383          |
+| 1.2566        | 1.8166 | 1800 | 1.4117          |
+| 1.1672        | 2.0182 | 2000 | 1.4227          |
+| 1.1267        | 2.2200 | 2200 | 1.4141          |
+| 1.0195        | 2.4219 | 2400 | 1.4098          |
+| 1.084         | 2.6238 | 2600 | 1.4063          |
+| 1.1254        | 2.8256 | 2800 | 1.4059          |
 ### Framework versions