Visionary-R1 / README.md
JiaerX's picture
Update README.md
0dfdb5c verified
|
raw
history blame
747 Bytes
metadata
base_model:
  - Qwen/Qwen2.5-VL-3B-Instruct
tags:
  - multimodel
  - reasoning

Model Sources

Uses

You can follow the instruction of Qwen2.5-VL to use the checkpoints.

Citation

@article{xia2025visionary,
  title={Visionary-R1: Mitigating Shortcuts in Visual Reasoning with Reinforcement Learning},
  author={Xia, Jiaer and Zang, Yuhang and Gao, Peng and Li, Yixuan and Zhou, Kaiyang},
  journal={arXiv preprint arXiv:2505.14677},
  year={2025}
}