---
base_model:
- Qwen/Qwen2.5-1.5B-Instruct
language:
- en
license: mit
library_name: transformers
pipeline_tag: text-generation
---

# DECS NRP Detector 

This repository contains the NRP (Necessary Reasoning Prefix) detector model used in the DECS algorithm, as presented in the paper [Overthinking Reduction with Decoupled Rewards and Curriculum Data Scheduling](https://huggingface.co/papers/2509.25827). 

The NRP detector is designed to determine whether a given reasoning chunk contains the ground truth signal, enabling surgically precise token-level rewards to reduce "overthinking" in reasoning models.

- **Project Page:** [https://pixas.github.io/decs-iclr26-site/](https://pixas.github.io/decs-iclr26-site/)
- **Repository:** [https://github.com/pixas/DECS](https://github.com/pixas/DECS)
- **Paper:** [arXiv:2509.25827](https://huggingface.co/papers/2509.25827)

## Usage

According to the official repository, you can deploy the NRP detector using `vLLM`:

```bash
vllm serve --model pixas/DECS_NRP_DETECTOR --port 10041 
```

## Citation

If you use this model, please cite the following work:

```bibtex
@inproceedings{jiang2026decs,
  title     = {Overthinking Reduction with Decoupled Rewards and Curriculum Data Scheduling},
  author    = {Jiang, Shuyang and Tao, Xiaofeng and Zhang, Kui and Xiao, Yanghua},
  booktitle = {International Conference on Learning Representations (ICLR)},
  year      = {2026},
  note      = {Oral},
  url       = {https://arxiv.org/abs/2509.25827}
}
```