Instructions to use minpeter/tiny-ko-124m-sft with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use minpeter/tiny-ko-124m-sft with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="minpeter/tiny-ko-124m-sft")
messages = [
    {"role": "user", "content": "Who are you?"},
]
pipe(messages)

# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("minpeter/tiny-ko-124m-sft")
model = AutoModelForCausalLM.from_pretrained("minpeter/tiny-ko-124m-sft")
messages = [
    {"role": "user", "content": "Who are you?"},
]
inputs = tokenizer.apply_chat_template(
	messages,
	add_generation_prompt=True,
	tokenize=True,
	return_dict=True,
	return_tensors="pt",
).to(model.device)

outputs = model.generate(**inputs, max_new_tokens=40)
print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:]))

Notebooks
Google Colab
Kaggle
Local Apps Settings

vLLM

How to use minpeter/tiny-ko-124m-sft with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "minpeter/tiny-ko-124m-sft"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "minpeter/tiny-ko-124m-sft",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker

docker model run hf.co/minpeter/tiny-ko-124m-sft

SGLang

How to use minpeter/tiny-ko-124m-sft with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "minpeter/tiny-ko-124m-sft" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "minpeter/tiny-ko-124m-sft",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "minpeter/tiny-ko-124m-sft" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "minpeter/tiny-ko-124m-sft",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Docker Model Runner
How to use minpeter/tiny-ko-124m-sft with Docker Model Runner:
```
docker model run hf.co/minpeter/tiny-ko-124m-sft
```

minpeter commited on Jul 2, 2025

Commit

319fdad

verified ·

1 Parent(s): b8585f4

End of training

Browse files

Files changed (1) hide show

README.md +103 -8

README.md CHANGED Viewed

@@ -5,7 +5,15 @@ tags:
 - axolotl
 - generated_from_trainer
 datasets:
 - lemon-mint/smol-koreantalk
 model-index:
 - name: tiny-ko-124m-sft
   results: []
@@ -33,6 +41,14 @@ strict: false
 chat_template: chatml
 datasets:
   - path: lemon-mint/smol-koreantalk
     type: chat_template
     split: train
@@ -41,6 +57,64 @@ datasets:
       role: role
       content: content
 dataset_prepared_path: last_run_prepared
 val_set_size: 0.001
 save_safetensors: true
@@ -94,9 +168,9 @@ fsdp_config:
 # tiny-ko-124m-sft
-This model is a fine-tuned version of [minpeter/tiny-ko-124m-base](https://huggingface.co/minpeter/tiny-ko-124m-base) on the lemon-mint/smol-koreantalk dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.8151
 ## Model description
@@ -127,17 +201,38 @@ The following hyperparameters were used during training:
 - optimizer: Use OptimizerNames.ADAMW_BNB with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: cosine
 - lr_scheduler_warmup_steps: 20
-- training_steps: 887
 ### Training results
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
-| No log        | 0      | 0    | 2.8035          |
-| 2.0195        | 0.2256 | 200  | 1.9871          |
-| 1.8857        | 0.4513 | 400  | 1.8815          |
-| 1.8013        | 0.6769 | 600  | 1.8270          |
-| 1.8489        | 0.9026 | 800  | 1.8151          |
 ### Framework versions

 - axolotl
 - generated_from_trainer
 datasets:
+- lemon-mint/Korean-FineTome-100k
 - lemon-mint/smol-koreantalk
+- heegyu/open-korean-instructions-v20231020
+- trillionlabs/multisystem-curated
+- allenai/tulu-3-sft-personas-instruction-following
+- coastral/korean-writing-style-instruct
+- devngho/korean-instruction-mix
+- youjunhyeok/Magpie-Pro-300K-Filtered-ko
+- youjunhyeok/smoltalk-ko-translate
 model-index:
 - name: tiny-ko-124m-sft
   results: []
 chat_template: chatml
 datasets:
+  - path: lemon-mint/Korean-FineTome-100k
+    type: chat_template
+    split: train
+    field_messages: messages
+    message_property_mappings:
+      role: role
+      content: content
   - path: lemon-mint/smol-koreantalk
     type: chat_template
     split: train
       role: role
       content: content
+  - path: heegyu/open-korean-instructions-v20231020
+    type: chat_template
+    split: train
+    field_messages: conversations
+    message_property_mappings:
+      role: from
+      content: value
+    roles:
+      user: ["human", "user"]
+      assistant: ["gpt", "assistant", "bot"]
+      system: ["system", "input"]
+  - path: trillionlabs/multisystem-curated
+    type: chat_template
+    split: train
+    field_messages: messages
+    message_property_mappings:
+      role: role
+      content: content
+  - path: allenai/tulu-3-sft-personas-instruction-following
+    type: chat_template
+    split: train
+    field_messages: messages
+    message_property_mappings:
+      role: role
+      content: content
+  - path: coastral/korean-writing-style-instruct
+    type: chat_template
+    split: train
+    field_messages: conversations
+    message_property_mappings:
+      role: from
+      content: value
+  - path: devngho/korean-instruction-mix
+    type: chat_template
+    split: train
+    field_messages: messages
+    message_property_mappings:
+      role: from
+      content: value
+  - path: youjunhyeok/Magpie-Pro-300K-Filtered-ko
+    type: chat_template
+    split: train
+    field_messages: conversations
+    message_property_mappings:
+      role: from
+      content: value
+  - path: youjunhyeok/smoltalk-ko-translate
+    type: chat_template
+    split: train
+    name: merge_filtered
+    field_messages: conversations
+    message_property_mappings:
+      role: role
+      content: content
 dataset_prepared_path: last_run_prepared
 val_set_size: 0.001
 save_safetensors: true
 # tiny-ko-124m-sft
+This model is a fine-tuned version of [minpeter/tiny-ko-124m-base](https://huggingface.co/minpeter/tiny-ko-124m-base) on the lemon-mint/Korean-FineTome-100k, the lemon-mint/smol-koreantalk, the heegyu/open-korean-instructions-v20231020, the trillionlabs/multisystem-curated, the allenai/tulu-3-sft-personas-instruction-following, the coastral/korean-writing-style-instruct, the devngho/korean-instruction-mix, the youjunhyeok/Magpie-Pro-300K-Filtered-ko and the youjunhyeok/smoltalk-ko-translate datasets.
 It achieves the following results on the evaluation set:
+- Loss: 1.7098
 ## Model description
 - optimizer: Use OptimizerNames.ADAMW_BNB with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: cosine
 - lr_scheduler_warmup_steps: 20
+- training_steps: 5042
 ### Training results
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
+| No log        | 0      | 0    | 2.7016          |
+| 2.1419        | 0.0397 | 200  | 2.1320          |
+| 2.0675        | 0.0793 | 400  | 2.0446          |
+| 2.0252        | 0.1190 | 600  | 1.9864          |
+| 1.9304        | 0.1587 | 800  | 1.9468          |
+| 1.9536        | 0.1983 | 1000 | 1.9145          |
+| 1.8692        | 0.2380 | 1200 | 1.8879          |
+| 1.8556        | 0.2777 | 1400 | 1.8645          |
+| 1.8421        | 0.3174 | 1600 | 1.8433          |
+| 1.9118        | 0.3570 | 1800 | 1.8256          |
+| 1.7791        | 0.3967 | 2000 | 1.8090          |
+| 1.8162        | 0.4364 | 2200 | 1.7934          |
+| 1.796         | 0.4760 | 2400 | 1.7795          |
+| 1.749         | 0.5157 | 2600 | 1.7661          |
+| 1.7536        | 0.5554 | 2800 | 1.7540          |
+| 1.7672        | 0.5950 | 3000 | 1.7432          |
+| 1.7523        | 0.6347 | 3200 | 1.7336          |
+| 1.7074        | 0.6744 | 3400 | 1.7259          |
+| 1.7218        | 0.7141 | 3600 | 1.7202          |
+| 1.6928        | 0.7537 | 3800 | 1.7158          |
+| 1.7184        | 0.7934 | 4000 | 1.7127          |
+| 1.761         | 0.8331 | 4200 | 1.7109          |
+| 1.7481        | 0.8727 | 4400 | 1.7101          |
+| 1.7245        | 0.9124 | 4600 | 1.7098          |
+| 1.7076        | 0.9521 | 4800 | 1.7097          |
+| 1.7403        | 0.9917 | 5000 | 1.7098          |
 ### Framework versions