RoadQAQ commited on
Commit
26e717f
·
verified ·
1 Parent(s): 8e84b27

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +6 -1
README.md CHANGED
@@ -13,5 +13,10 @@ Github: https://github.com/TheRoadQaQ/ReLIFT
13
  # Citation
14
  If you find our model, data, or evaluation code useful, please kindly cite our paper:
15
  ```bib
16
-
 
 
 
 
 
17
  ```
 
13
  # Citation
14
  If you find our model, data, or evaluation code useful, please kindly cite our paper:
15
  ```bib
16
+ @article{ma2025learning,
17
+ title={Learning What Reinforcement Learning Can't: Interleaved Online Fine-Tuning for Hardest Questions},
18
+ author={Ma, Lu and Liang, Hao and Qiang, Meiyi and Tang, Lexiang and Ma, Xiaochen and Wong, Zhen Hao and Niu, Junbo and Shen, Chengyu and He, Runming and Cui, Bin and others},
19
+ journal={arXiv preprint arXiv:2506.07527},
20
+ year={2025}
21
+ }
22
  ```