Update README.md
Browse files
README.md
CHANGED
|
@@ -13,5 +13,10 @@ Github: https://github.com/TheRoadQaQ/ReLIFT
|
|
| 13 |
# Citation
|
| 14 |
If you find our model, data, or evaluation code useful, please kindly cite our paper:
|
| 15 |
```bib
|
| 16 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 17 |
```
|
|
|
|
| 13 |
# Citation
|
| 14 |
If you find our model, data, or evaluation code useful, please kindly cite our paper:
|
| 15 |
```bib
|
| 16 |
+
@article{ma2025learning,
|
| 17 |
+
title={Learning What Reinforcement Learning Can't: Interleaved Online Fine-Tuning for Hardest Questions},
|
| 18 |
+
author={Ma, Lu and Liang, Hao and Qiang, Meiyi and Tang, Lexiang and Ma, Xiaochen and Wong, Zhen Hao and Niu, Junbo and Shen, Chengyu and He, Runming and Cui, Bin and others},
|
| 19 |
+
journal={arXiv preprint arXiv:2506.07527},
|
| 20 |
+
year={2025}
|
| 21 |
+
}
|
| 22 |
```
|