wandermay commited on
Commit
96ce973
·
verified ·
1 Parent(s): d13f3d3

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +5 -5
README.md CHANGED
@@ -6,12 +6,12 @@ license: mit
6
  ### Model description
7
  Most existing mthods focused on distilling DeepSeek-R1 to improve reasoning ability. However, as far as we know, there is no distilled model could surpass DeepSeek-R1 or QwQ-32B. We introduce NTele-R1-32B-DS , a state-of-the-art mathematical reasoning model that outperforms QwQ-32B across common reasoning benchmarks, including AIME2024/2025, MATH500 and GPQA-Diamond.
8
  Notely, NTele-R1-32B-DS is the first that achieves **more than 80/70 in challenging AIME2024/2025**.
9
- | Model | Trained From | Release Date | AIME2024(ours/reported) | AIME2025(o/r) | MATH500(o/r) | GPQA-Diamond(o/r) |
10
  |-------|-------|-------|-------|-------|-------|-------|
11
- | QwQ-32B | - | 25.3.6 | 76.25 / 79.5 | 67.30 / - | 94.6 / - | 63.6 / - |
12
- | DeepSeek-32B-Distill | Qwen2.5-32B-Instruct | 25.1.20 | 64.17 / 72.6 | 55.21 / - | 89.8 / 94.3 | 62.1 / 62.1 |
13
- | Light-R1-32B-DS | DeepSeek-R1-Distill-Qwen-32B | 25.3.12 | 74.79 / 78.1 | 68.54 / 65.9 | 92 / - | **69.19 / 68.0** |
14
- | AReal-boba-SFT-32B | DeepSeek-R1-Distill-Qwen-32B | 25.3.30 | 70.63 / 78.8 | 63.54 / 62.1 | 88.8 / - | 64.65 / 60.1 |
15
  | Ntele-R1-32B-DS | DeepSeek-R1-Distill-Qwen-32B | 25.4.17 | **80.42**| **73.54** | **95.4** | 66.16 |
16
 
17
 
 
6
  ### Model description
7
  Most existing mthods focused on distilling DeepSeek-R1 to improve reasoning ability. However, as far as we know, there is no distilled model could surpass DeepSeek-R1 or QwQ-32B. We introduce NTele-R1-32B-DS , a state-of-the-art mathematical reasoning model that outperforms QwQ-32B across common reasoning benchmarks, including AIME2024/2025, MATH500 and GPQA-Diamond.
8
  Notely, NTele-R1-32B-DS is the first that achieves **more than 80/70 in challenging AIME2024/2025**.
9
+ | Model | Trained From | Release Date | AIME2024 | AIME2025 | MATH500 | GPQA-Diamond |
10
  |-------|-------|-------|-------|-------|-------|-------|
11
+ | QwQ-32B | - | 25.3.6 | 76.25 | 67.30 | 94.6 | 63.6 |
12
+ | DeepSeek-32B-Distill | Qwen2.5-32B-Instruct | 25.1.20 | 64.17 | 55.21 | 89.8 | 62.1 |
13
+ | Light-R1-32B-DS | DeepSeek-R1-Distill-Qwen-32B | 25.3.12 | 74.79 | 68.54 | 92 | **69.19** |
14
+ | AReal-boba-SFT-32B | DeepSeek-R1-Distill-Qwen-32B | 25.3.30 | 70.63 | 63.54 | 88.8 | 64.65 |
15
  | Ntele-R1-32B-DS | DeepSeek-R1-Distill-Qwen-32B | 25.4.17 | **80.42**| **73.54** | **95.4** | 66.16 |
16
 
17