Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models Paper • 2606.11025 • Published 2 days ago • 32
Exploring the Design Space of Reward Backpropagation for Flow Matching Paper • 2606.11075 • Published 1 day ago • 1