YannQi
/

R-4B

Image-Text-to-Text

feature-extraction

Model card Files Files and versions

YannQi commited on Sep 1, 2025

Commit

d1b06a5

·

verified ·

1 Parent(s): 5cc3f24

Update README.md

Files changed (1) hide show

README.md +10 -2

README.md CHANGED Viewed

@@ -8,7 +8,7 @@ pipeline_tag: visual-question-answering
 ---
 # R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Annealing and Reinforce Learning
-[[📚 Arxiv Paper (Coming soon)](https://huggingface.co/YannQi/R-4B)] [[🤗 Hugging Face](https://huggingface.co/YannQi/R-4B)]  [[🤖️ ModelScope](https://huggingface.co/YannQi/R-4B)] [[💻 Code](https://github.com/yannqi/R-4B)]
 <div align="center">
 <img src="asset/logo_R_4B.png" alt="logo" width="38" />
@@ -211,7 +211,15 @@ print("Chat response:", chat_response)
 ## ✒️ Citation
-Coming soon!
 ## Acknowledgements

 ---
 # R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Annealing and Reinforce Learning
+[[📚 Arxiv Paper](https://arxiv.org/pdf/2508.21113)] [[🤗 Hugging Face](https://huggingface.co/YannQi/R-4B)]  [[🤖️ ModelScope](https://huggingface.co/YannQi/R-4B)] [[💻 Code](https://github.com/yannqi/R-4B)]
 <div align="center">
 <img src="asset/logo_R_4B.png" alt="logo" width="38" />
 ## ✒️ Citation
+@misc{jiang2025r4bincentivizinggeneralpurposeautothinking,
+      title={R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Annealing and Reinforce Learning},
+      author={Jie Jiang and Qi Yang and Bolin Ni and Shiming Xiang and Han Hu and Houwen Peng},
+      year={2025},
+      eprint={2508.21113},
+      archivePrefix={arXiv},
+      primaryClass={cs.CV},
+      url={https://arxiv.org/abs/2508.21113},
+}
 ## Acknowledgements