kurakurai
/

Luth-0.6B-Instruct

Text Generation

text-generation-inference

Model card Files Files and versions

MaxLSB commited on Aug 6

Commit

942e2bd

·

verified ·

1 Parent(s): 8498ba5

Update README.md

Files changed (1) hide show

README.md +18 -0

README.md CHANGED Viewed

@@ -11,6 +11,24 @@ base_model:
 pipeline_tag: text-generation
 ---
 ## Citation
 ```bibtex

 pipeline_tag: text-generation
 ---
+# Luth-0.6B
+**Luth-0.6B** is a French fine-tuned version of [Qwen3-0.6B](https://huggingface.co/Qwen/Qwen3-0.6B), trained on the [Luth-SFT](https://huggingface.co/datasets/kurakurai/luth-sft) dataset. The model has drastically improved its French capabilities in instruction following, math, and general knowledge. Additionally, its English capabilities have remained stable and have even increased in some areas.
+## Model Details
+Luth-0.6B was trained using full fine-tuning on the Luth-SFT dataset with [Axolotl](https://github.com/axolotl-ai-cloud/axolotl). The resulting model was then merged with the base Qwen3-0.6B model. This process successfully retained the model's English capabilities while improving its performance on nearly all benchmarks in both French and English.
+## Benchmark Results
+**French Evaluation:**
+![French Evaluation](media/french_evaluation.png)
+**English Evaluation:**
+![English Evaluation](media/english_evaluation.png)
 ## Citation
 ```bibtex