arieldeng commited on
Commit
4e22bee
·
verified ·
1 Parent(s): a02de28

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +6 -11
README.md CHANGED
@@ -41,22 +41,17 @@ HIPO has two main components:
41
 
42
  # Experimental Findings
43
 
44
- **Think-on Only Training (Overthinking).**
45
- Training the model solely on Think-on data causes it to reason on all problems, regardless of difficulty — a typical case of *overthinking*.
46
 
47
- **GRPO on Cold-Start(on).**
48
- Applying GRPO improves accuracy by **+3.1%**, but fails to reduce token length or thinking rate. Instead, token length on simpler datasets even increases to achieve higher accuracy.
49
 
50
  **Think-on/Think-off Mix.**
51
- Training on a mixed dataset boosts accuracy by **+4.0%** compared to Cold-Start(on), while significantly reducing token length (**–10.8%**) and thinking rate (**–22%**). Adding GRPO here brings little additional gain.
52
 
53
  **HiPO Advantage.**
54
- With HiPO, the Cold-Start model achieves the best performance:
55
- - **Accuracy: +6.2%**
56
- - **Token length: –30%**
57
- - **Thinking rate: –39%**
58
-
59
- Overall, HiPO outperforms existing methods in both **efficiency** and **accuracy**.
60
 
61
  ![Kim 2025-09-26 145349](https://cdn-uploads.huggingface.co/production/uploads/61ee40a269351366e29972ad/_qzxhMRTL_NTfaGb13LHc.png)
62
 
 
41
 
42
  # Experimental Findings
43
 
44
+ **Think-on Only (Overthinking).**
45
+ Training only on Think-on data makes the model reason on all problems, causing inefficiency.
46
 
47
+ **GRPO.**
48
+ Improves accuracy by **+3.1%**, but increases token length on simple tasks.
49
 
50
  **Think-on/Think-off Mix.**
51
+ Yields higher accuracy (**+4.0%**) while reducing token length (**–10.8%**) and thinking rate (**–22%**).
52
 
53
  **HiPO Advantage.**
54
+ Achieves the best results: **+6.2% accuracy**, **–30% token length**, **–39% thinking rate**, outperforming existing methods in both **efficiency** and **accuracy**.
 
 
 
 
 
55
 
56
  ![Kim 2025-09-26 145349](https://cdn-uploads.huggingface.co/production/uploads/61ee40a269351366e29972ad/_qzxhMRTL_NTfaGb13LHc.png)
57