Image-Text-to-Text
Transformers
Safetensors
qwen2_5_vl
conversational
text-generation-inference
File size: 476 Bytes
6ccb528
40282cc
 
6ccb528
 
 
40282cc
 
 
4e9a935
40282cc
4e9a935
 
1
2
3
4
5
6
7
8
9
10
11
12
13
---
base_model:
- Qwen/Qwen2.5-VL-3B-Instruct
datasets:
- WaltonFuture/Multimodal-Cold-Start
- WaltonFuture/Multimodal-RL-Data
license: apache-2.0
pipeline_tag: image-text-to-text
library_name: transformers
---

* 🐙 **GitHub Repo:** [waltonfuture/RL-with-Cold-Start](https://github.com/waltonfuture/RL-with-Cold-Start)
* 📜 **Paper (arXiv):** [Advancing Multimodal Reasoning via Reinforcement Learning with Cold Start (arXiv:2505.22334)](https://arxiv.org/abs/2505.22334)