Update README.md
Browse files
README.md
CHANGED
|
@@ -8,7 +8,7 @@ pipeline_tag: visual-question-answering
|
|
| 8 |
---
|
| 9 |
# R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Annealing and Reinforce Learning
|
| 10 |
|
| 11 |
-
[[📚 Arxiv Paper
|
| 12 |
|
| 13 |
<div align="center">
|
| 14 |
<img src="asset/logo_R_4B.png" alt="logo" width="38" />
|
|
@@ -211,7 +211,15 @@ print("Chat response:", chat_response)
|
|
| 211 |
|
| 212 |
## ✒️ Citation
|
| 213 |
|
| 214 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 215 |
|
| 216 |
## Acknowledgements
|
| 217 |
|
|
|
|
| 8 |
---
|
| 9 |
# R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Annealing and Reinforce Learning
|
| 10 |
|
| 11 |
+
[[📚 Arxiv Paper](https://arxiv.org/pdf/2508.21113)] [[🤗 Hugging Face](https://huggingface.co/YannQi/R-4B)] [[🤖️ ModelScope](https://huggingface.co/YannQi/R-4B)] [[💻 Code](https://github.com/yannqi/R-4B)]
|
| 12 |
|
| 13 |
<div align="center">
|
| 14 |
<img src="asset/logo_R_4B.png" alt="logo" width="38" />
|
|
|
|
| 211 |
|
| 212 |
## ✒️ Citation
|
| 213 |
|
| 214 |
+
@misc{jiang2025r4bincentivizinggeneralpurposeautothinking,
|
| 215 |
+
title={R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Annealing and Reinforce Learning},
|
| 216 |
+
author={Jie Jiang and Qi Yang and Bolin Ni and Shiming Xiang and Han Hu and Houwen Peng},
|
| 217 |
+
year={2025},
|
| 218 |
+
eprint={2508.21113},
|
| 219 |
+
archivePrefix={arXiv},
|
| 220 |
+
primaryClass={cs.CV},
|
| 221 |
+
url={https://arxiv.org/abs/2508.21113},
|
| 222 |
+
}
|
| 223 |
|
| 224 |
## Acknowledgements
|
| 225 |
|