Update README.md
Browse files
README.md
CHANGED
|
@@ -10,7 +10,7 @@ pipeline_tag: text-generation
|
|
| 10 |
|
| 11 |
# DECS NRP Detector
|
| 12 |
|
| 13 |
-
This repository contains the NRP (
|
| 14 |
|
| 15 |
The NRP detector is designed to determine whether a given reasoning chunk contains the ground truth signal, enabling surgically precise token-level rewards to reduce "overthinking" in reasoning models.
|
| 16 |
|
|
|
|
| 10 |
|
| 11 |
# DECS NRP Detector
|
| 12 |
|
| 13 |
+
This repository contains the NRP (Necessary Reasoning Prefix) detector model used in the DECS algorithm, as presented in the paper [Overthinking Reduction with Decoupled Rewards and Curriculum Data Scheduling](https://huggingface.co/papers/2509.25827).
|
| 14 |
|
| 15 |
The NRP detector is designed to determine whether a given reasoning chunk contains the ground truth signal, enabling surgically precise token-level rewards to reduce "overthinking" in reasoning models.
|
| 16 |
|