Jiaxi0775 commited on
Commit
cef443c
·
verified ·
1 Parent(s): 63edbe6

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +39 -3
README.md CHANGED
@@ -1,3 +1,39 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # STOP-1.5B: Early Path Pruning Module
2
+
3
+ This repository contains the STOP module trained for prefix-level path pruning on top of a 1.5B reasoning model.
4
+
5
+ ## Overview
6
+
7
+ STOP (Super TOken for Pruning) is a lightweight module that predicts whether a reasoning prefix is promising, enabling early pruning of unproductive paths.
8
+
9
+ It operates by:
10
+
11
+ - Appending a special `[STOP]` token
12
+ - Reading internal KV-cache states
13
+ - Producing a scalar quality score
14
+
15
+ ## Architecture
16
+
17
+ - Base model: frozen reasoning model (1.5B)
18
+ - Adapter: LoRA-based critique module
19
+ - Head: lightweight classifier
20
+
21
+ ## Training
22
+
23
+ The model is trained using prefix–potential supervision constructed via Monte Carlo rollouts.
24
+
25
+ ## Usage
26
+
27
+ After generating prefixes, STOP can be used to:
28
+
29
+ 1. Score each prefix
30
+ 2. Select top-k candidates
31
+ 3. Resume generation only on selected paths
32
+
33
+ ## Results
34
+
35
+ - Significant token reduction (up to 70%)
36
+ - Improved reasoning accuracy
37
+ - Strong performance in tool-use settings (AIMO3)
38
+
39
+ ## Citation