Update README.md
Browse files
README.md
CHANGED
|
@@ -34,8 +34,8 @@ datasets:
|
|
| 34 |
# 🚀 Model
|
| 35 |
|
| 36 |
The model is trained in **two progressive distillation stages**.
|
| 37 |
-
First, short-CoT data is used to distill **Qwen3-0.6B** into **
|
| 38 |
-
Then, long-CoT data is used to obtain **
|
| 39 |
|
| 40 |
| Model | Description | AIME24 | Link |
|
| 41 |
| ------------------------- | ----------------------------------------------| ------ | ---- |
|
|
|
|
| 34 |
# 🚀 Model
|
| 35 |
|
| 36 |
The model is trained in **two progressive distillation stages**.
|
| 37 |
+
First, short-CoT data is used to distill **Qwen3-0.6B** into **ReasonLite-0.6B-Turbo**, improving **AIME24 accuracy from 11.0 → 57.1**.
|
| 38 |
+
Then, long-CoT data is used to obtain **ReasonLite-0.6B**, further boosting accuracy to **75.2**.
|
| 39 |
|
| 40 |
| Model | Description | AIME24 | Link |
|
| 41 |
| ------------------------- | ----------------------------------------------| ------ | ---- |
|