| base_model: | |
| - Qwen/Qwen2.5-VL-3B-Instruct | |
| datasets: | |
| - WaltonFuture/Multimodal-Cold-Start | |
| - WaltonFuture/Multimodal-RL-Data | |
| license: apache-2.0 | |
| pipeline_tag: image-text-to-text | |
| library_name: transformers | |
| * ๐ **GitHub Repo:** [waltonfuture/RL-with-Cold-Start](https://github.com/waltonfuture/RL-with-Cold-Start) | |
| * ๐ **Paper (arXiv):** [Advancing Multimodal Reasoning via Reinforcement Learning with Cold Start (arXiv:2505.22334)](https://arxiv.org/abs/2505.22334) |