DECS_NRP_DETECTOR / README.md
pixas's picture
Update README.md
2474521 verified
---
base_model:
- Qwen/Qwen2.5-1.5B-Instruct
language:
- en
license: mit
library_name: transformers
pipeline_tag: text-generation
---
# DECS NRP Detector
This repository contains the NRP (Necessary Reasoning Prefix) detector model used in the DECS algorithm, as presented in the paper [Overthinking Reduction with Decoupled Rewards and Curriculum Data Scheduling](https://huggingface.co/papers/2509.25827).
The NRP detector is designed to determine whether a given reasoning chunk contains the ground truth signal, enabling surgically precise token-level rewards to reduce "overthinking" in reasoning models.
- **Project Page:** [https://pixas.github.io/decs-iclr26-site/](https://pixas.github.io/decs-iclr26-site/)
- **Repository:** [https://github.com/pixas/DECS](https://github.com/pixas/DECS)
- **Paper:** [arXiv:2509.25827](https://huggingface.co/papers/2509.25827)
## Usage
According to the official repository, you can deploy the NRP detector using `vLLM`:
```bash
vllm serve --model pixas/DECS_NRP_DETECTOR --port 10041
```
## Citation
If you use this model, please cite the following work:
```bibtex
@inproceedings{jiang2026decs,
title = {Overthinking Reduction with Decoupled Rewards and Curriculum Data Scheduling},
author = {Jiang, Shuyang and Tao, Xiaofeng and Zhang, Kui and Xiao, Yanghua},
booktitle = {International Conference on Learning Representations (ICLR)},
year = {2026},
note = {Oral},
url = {https://arxiv.org/abs/2509.25827}
}
```