nvidia
/

DLER-R1-1.5B-Research

Model card Files Files and versions

sliuau commited on Sep 3, 2025

Commit

f109c48

·

verified ·

1 Parent(s): 431dd21

Update README.md

Files changed (1) hide show

README.md +9 -0

README.md CHANGED Viewed

@@ -1,4 +1,13 @@
 # Model Overview
 ### Description:
 DLER-Qwen-R1-1.5B is an ultra-efficient 1.5B open-weight reasoning model designed for challenging tasks such as mathematics, programming, and scientific problem-solving. It is trained with the DLER algorithm on agentica-org/DeepScaleR-Preview-Dataset. Compared to DeepSeek’s 1.5B model, DLER-Qwen-R1-1.5B achieves substantial efficiency gains, reducing the average response length by nearly 80% across diverse mathematical benchmarks with better accuracy.

 # Model Overview
+<div align="center">
+<span style="font-family: default; font-size: 1.5em;">DLER-R1-1.5B</span>
+<div>
+🚀 The leading efficient reasoning model for cutting-edge research and development 🌟
+</div>
+</div>
+![Comparison between DeepSeek-R1-1.5B and DLER-R1-1.5B](./assets/latency_8b.png)
 ### Description:
 DLER-Qwen-R1-1.5B is an ultra-efficient 1.5B open-weight reasoning model designed for challenging tasks such as mathematics, programming, and scientific problem-solving. It is trained with the DLER algorithm on agentica-org/DeepScaleR-Preview-Dataset. Compared to DeepSeek’s 1.5B model, DLER-Qwen-R1-1.5B achieves substantial efficiency gains, reducing the average response length by nearly 80% across diverse mathematical benchmarks with better accuracy.