FlowRL: A Taxonomy and Modular Framework for Reinforcement Learning with Diffusion Policies Paper • 2603.27450 • Published Mar 29
Diffusion Reinforcement Learning via Centered Reward Distillation Paper • 2603.14128 • Published Mar 14
UDM-GRPO: Stable and Efficient Group Relative Policy Optimization for Uniform Discrete Diffusion Models Paper • 2604.18518 • Published Apr 20 • 7
GoRL: An Algorithm-Agnostic Framework for Online Reinforcement Learning with Generative Policies Paper • 2512.02581 • Published Dec 2, 2025 • 15