Add a link to SmolSwallow-1.5B-Instruct
Browse files
README.md
CHANGED
|
@@ -14,6 +14,8 @@ base_model:
|
|
| 14 |
**SmolSwallow-1.5B** is a Japanese compact language model created through TAID (Temporally Adaptive Interpolated Distillation), our new knowledge distillation method.
|
| 15 |
We used [Qwen2.5-32B-Instruct](https://huggingface.co/Qwen/Qwen2.5-32B-Instruct) as the teacher model and [Qwen2.5-1.5B-Instruct](https://huggingface.co/Qwen/Qwen2.5-1.5B-Instruct) as the student model.
|
| 16 |
The model has been further pre-trained on Japanese text data to enhance its Japanese language capabilities.
|
|
|
|
|
|
|
| 17 |
|
| 18 |
## Model Details
|
| 19 |
|
|
|
|
| 14 |
**SmolSwallow-1.5B** is a Japanese compact language model created through TAID (Temporally Adaptive Interpolated Distillation), our new knowledge distillation method.
|
| 15 |
We used [Qwen2.5-32B-Instruct](https://huggingface.co/Qwen/Qwen2.5-32B-Instruct) as the teacher model and [Qwen2.5-1.5B-Instruct](https://huggingface.co/Qwen/Qwen2.5-1.5B-Instruct) as the student model.
|
| 16 |
The model has been further pre-trained on Japanese text data to enhance its Japanese language capabilities.
|
| 17 |
+
|
| 18 |
+
If you are looking for an instruction-following model, check [SmolSwallow-1.5B-Instruct](https://huggingface.co/SakanaAI/SmolSwallow-1.5B-Instruct).
|
| 19 |
|
| 20 |
## Model Details
|
| 21 |
|