Update README.md
Browse files
README.md
CHANGED
|
@@ -11,6 +11,24 @@ base_model:
|
|
| 11 |
pipeline_tag: text-generation
|
| 12 |
---
|
| 13 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 14 |
## Citation
|
| 15 |
|
| 16 |
```bibtex
|
|
|
|
| 11 |
pipeline_tag: text-generation
|
| 12 |
---
|
| 13 |
|
| 14 |
+
# Luth-0.6B
|
| 15 |
+
|
| 16 |
+
**Luth-0.6B** is a French fine-tuned version of [Qwen3-0.6B](https://huggingface.co/Qwen/Qwen3-0.6B), trained on the [Luth-SFT](https://huggingface.co/datasets/kurakurai/luth-sft) dataset. The model has drastically improved its French capabilities in instruction following, math, and general knowledge. Additionally, its English capabilities have remained stable and have even increased in some areas.
|
| 17 |
+
|
| 18 |
+
## Model Details
|
| 19 |
+
|
| 20 |
+
Luth-0.6B was trained using full fine-tuning on the Luth-SFT dataset with [Axolotl](https://github.com/axolotl-ai-cloud/axolotl). The resulting model was then merged with the base Qwen3-0.6B model. This process successfully retained the model's English capabilities while improving its performance on nearly all benchmarks in both French and English.
|
| 21 |
+
|
| 22 |
+
## Benchmark Results
|
| 23 |
+
|
| 24 |
+
**French Evaluation:**
|
| 25 |
+
|
| 26 |
+

|
| 27 |
+
|
| 28 |
+
**English Evaluation:**
|
| 29 |
+
|
| 30 |
+

|
| 31 |
+
|
| 32 |
## Citation
|
| 33 |
|
| 34 |
```bibtex
|