Update README.md
Browse files
README.md
CHANGED
|
@@ -19,12 +19,16 @@ pipeline_tag: text-generation
|
|
| 19 |
|
| 20 |
**Luth-0.6B-Instruct** is a French fine-tuned version of [Qwen3-0.6B](https://huggingface.co/Qwen/Qwen3-0.6B), trained on the [Luth-SFT](https://huggingface.co/datasets/kurakurai/luth-sft) dataset. The model has drastically improved its French capabilities in instruction following, math, and general knowledge. Additionally, its English capabilities have remained stable and have even increased in some areas.
|
| 21 |
|
|
|
|
|
|
|
| 22 |
## Model Details
|
| 23 |
|
| 24 |
Luth was trained using full fine-tuning on the Luth-SFT dataset with [Axolotl](https://github.com/axolotl-ai-cloud/axolotl). The resulting model was then merged with the base Qwen3-0.6B model. This process successfully retained the model's English capabilities while improving its performance on nearly all selected benchmarks in both French and English.
|
| 25 |
|
| 26 |
## Benchmark Results
|
| 27 |
|
|
|
|
|
|
|
| 28 |
**French Evaluation:**
|
| 29 |
|
| 30 |

|
|
|
|
| 19 |
|
| 20 |
**Luth-0.6B-Instruct** is a French fine-tuned version of [Qwen3-0.6B](https://huggingface.co/Qwen/Qwen3-0.6B), trained on the [Luth-SFT](https://huggingface.co/datasets/kurakurai/luth-sft) dataset. The model has drastically improved its French capabilities in instruction following, math, and general knowledge. Additionally, its English capabilities have remained stable and have even increased in some areas.
|
| 21 |
|
| 22 |
+
Our Evaluation, training and data scripts are available on [GitHub](https://github.com/kurakurai/Luth).
|
| 23 |
+
|
| 24 |
## Model Details
|
| 25 |
|
| 26 |
Luth was trained using full fine-tuning on the Luth-SFT dataset with [Axolotl](https://github.com/axolotl-ai-cloud/axolotl). The resulting model was then merged with the base Qwen3-0.6B model. This process successfully retained the model's English capabilities while improving its performance on nearly all selected benchmarks in both French and English.
|
| 27 |
|
| 28 |
## Benchmark Results
|
| 29 |
|
| 30 |
+
We used LightEval for evaluation, with custom tasks for the French benchmarks. The models were evaluated with a `temperature=0`.
|
| 31 |
+
|
| 32 |
**French Evaluation:**
|
| 33 |
|
| 34 |

|