armenjeddi/PCGRPO-Qwen2.5-VL-3B-Jigsaw-Base-plus-curriculum-plus-CARE
4B
•
Updated
•
23
Qwen2.5-VL-3B & 7B models trained with PC-GRPO in the paper: Puzzle Curriculum GRPO for Vision-Centric Reasoning