DenseGRPO: From Sparse to Dense Reward for Flow Matching Model Alignment Paper • 2601.20218 • Published Jan 28 • 16
End-to-End Video Character Replacement without Structural Guidance Paper • 2601.08587 • Published Jan 13 • 8
GUI Exploration Lab: Enhancing Screen Navigation in Agents via Multi-Turn Reinforcement Learning Paper • 2512.02423 • Published Dec 2, 2025 • 5