Update README.md
Browse files
README.md
CHANGED
|
@@ -6,12 +6,12 @@ license: mit
|
|
| 6 |
### Model description
|
| 7 |
Most existing mthods focused on distilling DeepSeek-R1 to improve reasoning ability. However, as far as we know, there is no distilled model could surpass DeepSeek-R1 or QwQ-32B. We introduce NTele-R1-32B-DS , a state-of-the-art mathematical reasoning model that outperforms QwQ-32B across common reasoning benchmarks, including AIME2024/2025, MATH500 and GPQA-Diamond.
|
| 8 |
Notely, NTele-R1-32B-DS is the first that achieves **more than 80/70 in challenging AIME2024/2025**.
|
| 9 |
-
| Model | Trained From | Release Date | AIME2024
|
| 10 |
|-------|-------|-------|-------|-------|-------|-------|
|
| 11 |
-
| QwQ-32B | - | 25.3.6 | 76.25
|
| 12 |
-
| DeepSeek-32B-Distill | Qwen2.5-32B-Instruct | 25.1.20 | 64.17
|
| 13 |
-
| Light-R1-32B-DS | DeepSeek-R1-Distill-Qwen-32B | 25.3.12 | 74.79
|
| 14 |
-
| AReal-boba-SFT-32B | DeepSeek-R1-Distill-Qwen-32B | 25.3.30 | 70.63
|
| 15 |
| Ntele-R1-32B-DS | DeepSeek-R1-Distill-Qwen-32B | 25.4.17 | **80.42**| **73.54** | **95.4** | 66.16 |
|
| 16 |
|
| 17 |
|
|
|
|
| 6 |
### Model description
|
| 7 |
Most existing mthods focused on distilling DeepSeek-R1 to improve reasoning ability. However, as far as we know, there is no distilled model could surpass DeepSeek-R1 or QwQ-32B. We introduce NTele-R1-32B-DS , a state-of-the-art mathematical reasoning model that outperforms QwQ-32B across common reasoning benchmarks, including AIME2024/2025, MATH500 and GPQA-Diamond.
|
| 8 |
Notely, NTele-R1-32B-DS is the first that achieves **more than 80/70 in challenging AIME2024/2025**.
|
| 9 |
+
| Model | Trained From | Release Date | AIME2024 | AIME2025 | MATH500 | GPQA-Diamond |
|
| 10 |
|-------|-------|-------|-------|-------|-------|-------|
|
| 11 |
+
| QwQ-32B | - | 25.3.6 | 76.25 | 67.30 | 94.6 | 63.6 |
|
| 12 |
+
| DeepSeek-32B-Distill | Qwen2.5-32B-Instruct | 25.1.20 | 64.17 | 55.21 | 89.8 | 62.1 |
|
| 13 |
+
| Light-R1-32B-DS | DeepSeek-R1-Distill-Qwen-32B | 25.3.12 | 74.79 | 68.54 | 92 | **69.19** |
|
| 14 |
+
| AReal-boba-SFT-32B | DeepSeek-R1-Distill-Qwen-32B | 25.3.30 | 70.63 | 63.54 | 88.8 | 64.65 |
|
| 15 |
| Ntele-R1-32B-DS | DeepSeek-R1-Distill-Qwen-32B | 25.4.17 | **80.42**| **73.54** | **95.4** | 66.16 |
|
| 16 |
|
| 17 |
|