Update README.md
Browse files
README.md
CHANGED
|
@@ -102,4 +102,18 @@ YOUR_MODEL_PATH="<your_model_path>"
|
|
| 102 |
CKPTS_SAVE_DIR="<ckpts_save_path>"
|
| 103 |
YOUR_TRAIN_FILE="<train_data_path>"
|
| 104 |
YOUR_TEST_FILE="<test_data_path>"
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 105 |
```
|
|
|
|
| 102 |
CKPTS_SAVE_DIR="<ckpts_save_path>"
|
| 103 |
YOUR_TRAIN_FILE="<train_data_path>"
|
| 104 |
YOUR_TEST_FILE="<test_data_path>"
|
| 105 |
+
```
|
| 106 |
+
|
| 107 |
+
## 🤝 Citation
|
| 108 |
+
If you find this work helpful, please cite our paper:
|
| 109 |
+
```bibtex
|
| 110 |
+
@misc{su2025klearreasoneradvancingreasoningcapability,
|
| 111 |
+
title={Klear-Reasoner: Advancing Reasoning Capability via Gradient-Preserving Clipping Policy Optimization},
|
| 112 |
+
author={Zhenpeng Su and Leiyu Pan and Xue Bai and Dening Liu and Guanting Dong and Jiaming Huang and Wenping Hu and Guorui Zhou},
|
| 113 |
+
year={2025},
|
| 114 |
+
eprint={2508.07629},
|
| 115 |
+
archivePrefix={arXiv},
|
| 116 |
+
primaryClass={cs.LG},
|
| 117 |
+
url={https://arxiv.org/abs/2508.07629},
|
| 118 |
+
}
|
| 119 |
```
|