view article Article From GRPO to DAPO and GSPO: What, Why, and How NormalUhr • Aug 9, 2025 • 119
Perception-Aware Policy Optimization for Multimodal Reasoning Paper • 2507.06448 • Published Jul 8, 2025 • 48