Update README.md
Browse files
README.md
CHANGED
|
@@ -5,4 +5,6 @@ datasets:
|
|
| 5 |
- WaltonFuture/Multimodal-RL-Data
|
| 6 |
base_model:
|
| 7 |
- Qwen/Qwen2.5-VL-3B-Instruct
|
| 8 |
-
---
|
|
|
|
|
|
|
|
|
| 5 |
- WaltonFuture/Multimodal-RL-Data
|
| 6 |
base_model:
|
| 7 |
- Qwen/Qwen2.5-VL-3B-Instruct
|
| 8 |
+
---
|
| 9 |
+
* 🐙 **GitHub Repo:** [waltonfuture/RL-with-Cold-Start](https://github.com/waltonfuture/RL-with-Cold-Start)
|
| 10 |
+
* 📜 **Paper (arXiv):** [Advancing Multimodal Reasoning via Reinforcement Learning with Cold Start (arXiv:2505.22334)](https://arxiv.org/abs/2505.22334)
|