JingHaoZ
/

RLFR-Qwen2.5-Math-7B

Text Generation

text-generation-inference

Model card Files Files and versions

JingHaoZ commited on Oct 14, 2025

Commit

680ef41

·

verified ·

1 Parent(s): 1b1707f

Update README.md

Files changed (1) hide show

README.md +8 -3

README.md CHANGED Viewed

@@ -64,10 +64,15 @@ generated_ids = [
 response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]
 ```
-<!-- ## Citation
 If you find our work helpful, feel free to give us a citation.
 ```
-``` -->

 response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]
 ```
+## Citation
 If you find our work helpful, feel free to give us a citation.
 ```
+@article{zhang2025rlfr,
+  title={RLFR: Extending Reinforcement Learning for LLMs with Flow Environment},
+  author={Zhang, Jinghao and Zheng, Naishan and Li, Ruilin and Cheng, Dongzhou and Liang, Zheming and Zhao, Feng and Wang, Jiaqi},
+  journal={arXiv preprint arXiv:2510.10201},
+  year={2025}
+}
+```