Safetensors
English
qwen2
sliuau commited on
Commit
568e757
·
verified ·
1 Parent(s): 4d5995d

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -49,7 +49,7 @@ The integration of foundation and fine-tuned models into AI systems requires add
49
  ### Evaluation Results:
50
  **Benchmark Score <br>
51
 
52
- | Model | Math 500 | Length | AIME | Length | AMC | Length | Minerva |Length | Olympiad |Length | Total Avg |
53
  |------------------|----------|------------|--------------------|------------------|--------------------|------------------|--------------------|------------------|--------------------|------------------|-----------------|
54
  | Deepseek-R1-7B | 93.60 | 3999 | 55.40 | 13241 | 82.90 | 7461 | 49.79 | 5199 | 58.21 | 8837 | 7747 |
55
  | **DLER-R1-7B** | **94.21 (+0.61%)** | **1634 (-60%)** | **55.62 (+0.22%)** | **3230 (-76%)** | **84.41 (+1.51%)** | **2512 (-0.67%)** | **53.88 (+4.09%)** | **2058 (-61%)** | **60.48 (+2.27%)** | **2592 (-71%)** | **2405 (-69%)** |
 
49
  ### Evaluation Results:
50
  **Benchmark Score <br>
51
 
52
+ | Model | MATH | Length | AIME | Length | AMC | Length | Minerva |Length | Olympiad |Length | Total Avg |
53
  |------------------|----------|------------|--------------------|------------------|--------------------|------------------|--------------------|------------------|--------------------|------------------|-----------------|
54
  | Deepseek-R1-7B | 93.60 | 3999 | 55.40 | 13241 | 82.90 | 7461 | 49.79 | 5199 | 58.21 | 8837 | 7747 |
55
  | **DLER-R1-7B** | **94.21 (+0.61%)** | **1634 (-60%)** | **55.62 (+0.22%)** | **3230 (-76%)** | **84.41 (+1.51%)** | **2512 (-0.67%)** | **53.88 (+4.09%)** | **2058 (-61%)** | **60.48 (+2.27%)** | **2592 (-71%)** | **2405 (-69%)** |