Qwen2.5VL-3b-RLCS / README.md

Add library name and pipeline tag to the model card

40282cc verified 9 months ago

476 Bytes

	---
	base_model:
	- Qwen/Qwen2.5-VL-3B-Instruct
	datasets:
	- WaltonFuture/Multimodal-Cold-Start
	- WaltonFuture/Multimodal-RL-Data
	license: apache-2.0
	pipeline_tag: image-text-to-text
	library_name: transformers
	---

	* 🐙 GitHub Repo: [waltonfuture/RL-with-Cold-Start](https://github.com/waltonfuture/RL-with-Cold-Start)
	* 📜 Paper (arXiv): [Advancing Multimodal Reasoning via Reinforcement Learning with Cold Start (arXiv:2505.22334)](https://arxiv.org/abs/2505.22334)