Submitted by
Tianyu Pang
AI & ML interests
None defined yet.
Recent Activity
View all activity
Papers
Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models
Rethinking the Divergence Regularization in LLM RL
None defined yet.
Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models
Rethinking the Divergence Regularization in LLM RL