nvidia
/

DLER-R1-7B-Research

Model card Files Files and versions

sliuau commited on Sep 3, 2025

Commit

568e757

·

verified ·

1 Parent(s): 4d5995d

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -49,7 +49,7 @@ The integration of foundation and fine-tuned models into AI systems requires add
 ### Evaluation Results:
 **Benchmark Score <br>
-| Model            | Math 500 | Length | AIME               | Length        | AMC                | Length        | Minerva            |Length         | Olympiad           |Length         | Total Avg    |
 |------------------|----------|------------|--------------------|------------------|--------------------|------------------|--------------------|------------------|--------------------|------------------|-----------------|
 | Deepseek-R1-7B   | 93.60    | 3999       | 55.40              | 13241            | 82.90              | 7461             | 49.79              | 5199             | 58.21              | 8837             | 7747            |
 | **DLER-R1-7B**   | **94.21 (+0.61%)** | **1634 (-60%)** | **55.62 (+0.22%)** | **3230 (-76%)** | **84.41 (+1.51%)** | **2512 (-0.67%)** | **53.88 (+4.09%)** | **2058 (-61%)** | **60.48 (+2.27%)** | **2592 (-71%)** | **2405 (-69%)** |

 ### Evaluation Results:
 **Benchmark Score <br>
+| Model            | MATH | Length | AIME               | Length        | AMC                | Length        | Minerva            |Length         | Olympiad           |Length         | Total Avg    |
 |------------------|----------|------------|--------------------|------------------|--------------------|------------------|--------------------|------------------|--------------------|------------------|-----------------|
 | Deepseek-R1-7B   | 93.60    | 3999       | 55.40              | 13241            | 82.90              | 7461             | 49.79              | 5199             | 58.21              | 8837             | 7747            |
 | **DLER-R1-7B**   | **94.21 (+0.61%)** | **1634 (-60%)** | **55.62 (+0.22%)** | **3230 (-76%)** | **84.41 (+1.51%)** | **2512 (-0.67%)** | **53.88 (+4.09%)** | **2058 (-61%)** | **60.48 (+2.27%)** | **2592 (-71%)** | **2405 (-69%)** |