YannQi commited on
Commit
d1b06a5
·
verified ·
1 Parent(s): 5cc3f24

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +10 -2
README.md CHANGED
@@ -8,7 +8,7 @@ pipeline_tag: visual-question-answering
8
  ---
9
  # R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Annealing and Reinforce Learning
10
 
11
- [[📚 Arxiv Paper (Coming soon)](https://huggingface.co/YannQi/R-4B)] [[🤗 Hugging Face](https://huggingface.co/YannQi/R-4B)] [[🤖️ ModelScope](https://huggingface.co/YannQi/R-4B)] [[💻 Code](https://github.com/yannqi/R-4B)]
12
 
13
  <div align="center">
14
  <img src="asset/logo_R_4B.png" alt="logo" width="38" />
@@ -211,7 +211,15 @@ print("Chat response:", chat_response)
211
 
212
  ## ✒️ Citation
213
 
214
- Coming soon!
 
 
 
 
 
 
 
 
215
 
216
  ## Acknowledgements
217
 
 
8
  ---
9
  # R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Annealing and Reinforce Learning
10
 
11
+ [[📚 Arxiv Paper](https://arxiv.org/pdf/2508.21113)] [[🤗 Hugging Face](https://huggingface.co/YannQi/R-4B)] [[🤖️ ModelScope](https://huggingface.co/YannQi/R-4B)] [[💻 Code](https://github.com/yannqi/R-4B)]
12
 
13
  <div align="center">
14
  <img src="asset/logo_R_4B.png" alt="logo" width="38" />
 
211
 
212
  ## ✒️ Citation
213
 
214
+ @misc{jiang2025r4bincentivizinggeneralpurposeautothinking,
215
+ title={R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Annealing and Reinforce Learning},
216
+ author={Jie Jiang and Qi Yang and Bolin Ni and Shiming Xiang and Han Hu and Houwen Peng},
217
+ year={2025},
218
+ eprint={2508.21113},
219
+ archivePrefix={arXiv},
220
+ primaryClass={cs.CV},
221
+ url={https://arxiv.org/abs/2508.21113},
222
+ }
223
 
224
  ## Acknowledgements
225