Update README.md
Browse files
README.md
CHANGED
|
@@ -48,8 +48,8 @@ For each model, we used the official system prompt provided by the corresponding
|
|
| 48 |
| Benchmark | DeepSeek-R1 | Qwen3-14B | QwQ-32B | DeepSeek-R1-Distill-Qwen-14B | Confucius3-Math |
|
| 49 |
|-------------------|----------------------|------------|--------------|----------------|------------|
|
| 50 |
| CK12-MATH | 92.74 | 94.04 | 93.60 | 82.86 | **96.24** |
|
| 51 |
-
| GAOKAO-Bench(math) | 93.27 | 94.44 | 94.93 | 86.75 | **98.46** |
|
| 52 |
-
| MathBench(K12) | 89.99 | 96.51 | **96.57** | 88.40 | 95.10 |
|
| 53 |
| CMATH | 95.81 | 95.90 | 95.95 | 77.41 | **96.13** |
|
| 54 |
| MATH-500 | 97.30 | 96.80 | 98.00 | 93.90 | **98.80** |
|
| 55 |
| AIME 2024 | 79.80 | 79.30 | 79.50 | 69.70 | **81.15** |
|
|
|
|
| 48 |
| Benchmark | DeepSeek-R1 | Qwen3-14B | QwQ-32B | DeepSeek-R1-Distill-Qwen-14B | Confucius3-Math |
|
| 49 |
|-------------------|----------------------|------------|--------------|----------------|------------|
|
| 50 |
| CK12-MATH | 92.74 | 94.04 | 93.60 | 82.86 | **96.24** |
|
| 51 |
+
| GAOKAO-Bench (math) | 93.27 | 94.44 | 94.93 | 86.75 | **98.46** |
|
| 52 |
+
| MathBench (K12) | 89.99 | 96.51 | **96.57** | 88.40 | 95.10 |
|
| 53 |
| CMATH | 95.81 | 95.90 | 95.95 | 77.41 | **96.13** |
|
| 54 |
| MATH-500 | 97.30 | 96.80 | 98.00 | 93.90 | **98.80** |
|
| 55 |
| AIME 2024 | 79.80 | 79.30 | 79.50 | 69.70 | **81.15** |
|