Graph-GRPO: Stabilizing Multi-Agent Topology Learning via Group Relative Policy Optimization Paper • 2603.02701 • Published Mar 3 • 1