WaltonFuture commited on
Commit
4e9a935
·
verified ·
1 Parent(s): 6ccb528

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -1
README.md CHANGED
@@ -5,4 +5,6 @@ datasets:
5
  - WaltonFuture/Multimodal-RL-Data
6
  base_model:
7
  - Qwen/Qwen2.5-VL-3B-Instruct
8
- ---
 
 
 
5
  - WaltonFuture/Multimodal-RL-Data
6
  base_model:
7
  - Qwen/Qwen2.5-VL-3B-Instruct
8
+ ---
9
+ * 🐙 **GitHub Repo:** [waltonfuture/RL-with-Cold-Start](https://github.com/waltonfuture/RL-with-Cold-Start)
10
+ * 📜 **Paper (arXiv):** [Advancing Multimodal Reasoning via Reinforcement Learning with Cold Start (arXiv:2505.22334)](https://arxiv.org/abs/2505.22334)