PC-GRPO Collection Qwen2.5-VL-3B & 7B models trained with PC-GRPO in the paper: Puzzle Curriculum GRPO for Vision-Centric Reasoning • 9 items • Updated 2 days ago • 3