TDM-R1: Reinforcing Few-Step Diffusion Models with Non-Differentiable Reward Paper • 2603.07700 • Published 3 days ago • 12
ViewFusion: Structured Spatial Thinking Chains for Multi-View Reasoning Paper • 2603.06024 • Published 5 days ago • 5
Reinforcing Diffusion Models by Direct Group Preference Optimization Paper • 2510.08425 • Published Oct 9, 2025 • 12