RoadQAQ
/

Qwen2.5-Math-7B-16k-think

Text Generation

text-generation-inference

Model card Files Files and versions

RoadQAQ commited on Jun 18, 2025

Commit

26e717f

·

verified ·

1 Parent(s): 8e84b27

Update README.md

Files changed (1) hide show

README.md +6 -1

README.md CHANGED Viewed

@@ -13,5 +13,10 @@ Github: https://github.com/TheRoadQaQ/ReLIFT
 # Citation
 If you find our model, data, or evaluation code useful, please kindly cite our paper:
 ```bib
 ```

 # Citation
 If you find our model, data, or evaluation code useful, please kindly cite our paper:
 ```bib
+@article{ma2025learning,
+  title={Learning What Reinforcement Learning Can't: Interleaved Online Fine-Tuning for Hardest Questions},
+  author={Ma, Lu and Liang, Hao and Qiang, Meiyi and Tang, Lexiang and Ma, Xiaochen and Wong, Zhen Hao and Niu, Junbo and Shen, Chengyu and He, Runming and Cui, Bin and others},
+  journal={arXiv preprint arXiv:2506.07527},
+  year={2025}
+}
 ```