amd
/

ReasonLite-0.6B-Turbo

Model card Files Files and versions

llmll commited on Jan 20

Commit

12c1323

·

verified ·

1 Parent(s): db18d9e

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -34,8 +34,8 @@ datasets:
 # 🚀 Model
 The model is trained in **two progressive distillation stages**.
-First, short-CoT data is used to distill **Qwen3-0.6B** into **AMD-0.6B-Turbo**, improving **AIME24 accuracy from 11.0 → 57.1**.
-Then, long-CoT data is used to obtain **AMD-0.6B**, further boosting accuracy to **75.2**.
 | Model                     | Description                                   | AIME24 | Link |
 | ------------------------- | ----------------------------------------------| ------ | ---- |

 # 🚀 Model
 The model is trained in **two progressive distillation stages**.
+First, short-CoT data is used to distill **Qwen3-0.6B** into **ReasonLite-0.6B-Turbo**, improving **AIME24 accuracy from 11.0 → 57.1**.
+Then, long-CoT data is used to obtain **ReasonLite-0.6B**, further boosting accuracy to **75.2**.
 | Model                     | Description                                   | AIME24 | Link |
 | ------------------------- | ----------------------------------------------| ------ | ---- |