mikeogezi/Qwen2-VL-2B-GRPO-MMR-TrainedRationaleVerifier Image-to-Text • 2B • Updated Mar 20, 2025 • 18