Update README.md
Browse files
README.md
CHANGED
|
@@ -70,7 +70,7 @@ model-index:
|
|
| 70 |
|
| 71 |
> **KnowRL: Boosting LLM Reasoning via Reinforcement Learning with Minimal-Sufficient Knowledge Guidance**
|
| 72 |
|
| 73 |
-
[](https://github.com/HasuerYu/KnowRL)
|
| 75 |
[](https://huggingface.co/collections/HasuerYu/knowrl)
|
| 76 |
[](https://huggingface.co/datasets/HasuerYu/KnowRL-Train-Data)
|
|
@@ -193,7 +193,15 @@ An **entropy annealing** strategy is applied: after step 2,590, the clip upper b
|
|
| 193 |
If you find this model helpful, please cite:
|
| 194 |
|
| 195 |
```bibtex
|
| 196 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 197 |
```
|
| 198 |
|
| 199 |
## License
|
|
|
|
| 70 |
|
| 71 |
> **KnowRL: Boosting LLM Reasoning via Reinforcement Learning with Minimal-Sufficient Knowledge Guidance**
|
| 72 |
|
| 73 |
+
[](https://arxiv.org/abs/2604.12627)
|
| 74 |
[](https://github.com/HasuerYu/KnowRL)
|
| 75 |
[](https://huggingface.co/collections/HasuerYu/knowrl)
|
| 76 |
[](https://huggingface.co/datasets/HasuerYu/KnowRL-Train-Data)
|
|
|
|
| 193 |
If you find this model helpful, please cite:
|
| 194 |
|
| 195 |
```bibtex
|
| 196 |
+
@misc{yu2026knowrlboostingllmreasoning,
|
| 197 |
+
title={KnowRL: Boosting LLM Reasoning via Reinforcement Learning with Minimal-Sufficient Knowledge Guidance},
|
| 198 |
+
author={Linhao Yu and Tianmeng Yang and Siyu Ding and Renren Jin and Naibin Gu and Xiangzhao Hao and Shuaiyi Nie and Deyi Xiong and Weichong Yin and Yu Sun and Hua Wu},
|
| 199 |
+
year={2026},
|
| 200 |
+
eprint={2604.12627},
|
| 201 |
+
archivePrefix={arXiv},
|
| 202 |
+
primaryClass={cs.AI},
|
| 203 |
+
url={https://arxiv.org/abs/2604.12627},
|
| 204 |
+
}
|
| 205 |
```
|
| 206 |
|
| 207 |
## License
|