Update README.md
Browse files
README.md
CHANGED
|
@@ -4,4 +4,27 @@ base_model:
|
|
| 4 |
tags:
|
| 5 |
- multimodel
|
| 6 |
- reasoning
|
| 7 |
-
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4 |
tags:
|
| 5 |
- multimodel
|
| 6 |
- reasoning
|
| 7 |
+
---
|
| 8 |
+
|
| 9 |
+
|
| 10 |
+
### Model Sources
|
| 11 |
+
|
| 12 |
+
<!-- Provide the basic links for the model. -->
|
| 13 |
+
|
| 14 |
+
- **Repository:** https://github.com/maifoundations/Visionary-R1
|
| 15 |
+
- **Paper:** https://arxiv.org/pdf/2505.14677
|
| 16 |
+
- **Blog:** https://www.maifoundations.com/blog/visionary-r1/
|
| 17 |
+
|
| 18 |
+
## Uses
|
| 19 |
+
You can follow the instruction of [Qwen2.5-VL](https://huggingface.co/Qwen/Qwen2.5-VL-3B-Instruct) to use the checkpoints.
|
| 20 |
+
|
| 21 |
+
## Citation
|
| 22 |
+
|
| 23 |
+
```
|
| 24 |
+
@article{xia2025visionary,
|
| 25 |
+
title={Visionary-R1: Mitigating Shortcuts in Visual Reasoning with Reinforcement Learning},
|
| 26 |
+
author={Xia, Jiaer and Zang, Yuhang and Gao, Peng and Li, Yixuan and Zhou, Kaiyang},
|
| 27 |
+
journal={arXiv preprint arXiv:2505.14677},
|
| 28 |
+
year={2025}
|
| 29 |
+
}
|
| 30 |
+
```
|