D-OPSD: On-Policy Self-Distillation for Continuously Tuning Step-Distilled Diffusion Models
Paper • 2605.05204 • Published • 27
None defined yet.
D-OPSD: On-Policy Self-Distillation for Continuously Tuning Step-Distilled Diffusion Models
SeeUPO: Sequence-Level Agentic-RL with Convergence Guarantees