pixas
/

DECS_1.5B

@@ -1,26 +1,31 @@
 ---
 language:
 - zh
 - en
 pipeline_tag: text-generation
 tags:
 - deepscaler
 - reasoning
 - grpo
 - qwen2
-base_model: deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B
-license: other
 ---
 # DECS_1.5B
-This is the official model for ICLR 2026 Oral "Overthinking Reduction with Decoupled Rewards and Curriculum Data Scheduling".
-DECS_1.5B is a reasoning-focused causal language model built from `deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B` and further trained with DECS algorithm, focused on 50% fewer tokens when answering a reasoning-required problem.
 ## Model Summary
-- Base model: `deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B`
-- Upload date: `2026-02-24`
-- Recommended use: long-form reasoning and mathematical/problem-solving style generation
 ## Quick Start (Transformers)
@@ -86,4 +91,4 @@ booktitle={The Fourteenth International Conference on Learning Representations},
 year={2026},
 url={https://openreview.net/forum?id=kdeiRledV6}
 }
-```

 ---
+base_model: deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B
 language:
 - zh
 - en
+license: other
 pipeline_tag: text-generation
+library_name: transformers
 tags:
 - deepscaler
 - reasoning
 - grpo
 - qwen2
 ---
 # DECS_1.5B
+This is the official model for the ICLR 2026 Oral paper: "**Overthinking Reduction with Decoupled Rewards and Curriculum Data Scheduling**".
+[**Paper**](https://huggingface.co/papers/2509.25827) | [**Code**](https://github.com/pixas/DECS) | [**Project Page**](https://pixas.github.io/decs-iclr26-site/)
+DECS_1.5B is a reasoning-focused causal language model built from `deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B` and further trained with the DECS algorithm, focused on 50% fewer tokens when answering a reasoning-required problem.
 ## Model Summary
+- **Base model:** `deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B`
+- **Upload date:** `2026-02-24`
+- **Recommended use:** long-form reasoning and mathematical/problem-solving style generation
 ## Quick Start (Transformers)
 year={2026},
 url={https://openreview.net/forum?id=kdeiRledV6}
 }
+```