HasuerYu commited on
Commit
afdef67
·
verified ·
1 Parent(s): 77ab492

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +10 -2
README.md CHANGED
@@ -70,7 +70,7 @@ model-index:
70
 
71
  > **KnowRL: Boosting LLM Reasoning via Reinforcement Learning with Minimal-Sufficient Knowledge Guidance**
72
 
73
- [![arXiv](https://img.shields.io/badge/arXiv-2505.XXXXX-b31b1b.svg)](https://arxiv.org/abs/2505.XXXXX)
74
  [![GitHub](https://img.shields.io/badge/💻%20GitHub-KnowRL-black)](https://github.com/HasuerYu/KnowRL)
75
  [![Collection](https://img.shields.io/badge/🤗%20HuggingFace-Collection-yellow)](https://huggingface.co/collections/HasuerYu/knowrl)
76
  [![Training Data](https://img.shields.io/badge/🤗%20HuggingFace-Training%20Data-yellow)](https://huggingface.co/datasets/HasuerYu/KnowRL-Train-Data)
@@ -193,7 +193,15 @@ An **entropy annealing** strategy is applied: after step 2,590, the clip upper b
193
  If you find this model helpful, please cite:
194
 
195
  ```bibtex
196
-
 
 
 
 
 
 
 
 
197
  ```
198
 
199
  ## License
 
70
 
71
  > **KnowRL: Boosting LLM Reasoning via Reinforcement Learning with Minimal-Sufficient Knowledge Guidance**
72
 
73
+ [![arXiv](https://img.shields.io/badge/2604.12627-b31b1b.svg)](https://arxiv.org/abs/2604.12627)
74
  [![GitHub](https://img.shields.io/badge/💻%20GitHub-KnowRL-black)](https://github.com/HasuerYu/KnowRL)
75
  [![Collection](https://img.shields.io/badge/🤗%20HuggingFace-Collection-yellow)](https://huggingface.co/collections/HasuerYu/knowrl)
76
  [![Training Data](https://img.shields.io/badge/🤗%20HuggingFace-Training%20Data-yellow)](https://huggingface.co/datasets/HasuerYu/KnowRL-Train-Data)
 
193
  If you find this model helpful, please cite:
194
 
195
  ```bibtex
196
+ @misc{yu2026knowrlboostingllmreasoning,
197
+ title={KnowRL: Boosting LLM Reasoning via Reinforcement Learning with Minimal-Sufficient Knowledge Guidance},
198
+ author={Linhao Yu and Tianmeng Yang and Siyu Ding and Renren Jin and Naibin Gu and Xiangzhao Hao and Shuaiyi Nie and Deyi Xiong and Weichong Yin and Yu Sun and Hua Wu},
199
+ year={2026},
200
+ eprint={2604.12627},
201
+ archivePrefix={arXiv},
202
+ primaryClass={cs.AI},
203
+ url={https://arxiv.org/abs/2604.12627},
204
+ }
205
  ```
206
 
207
  ## License