Elucidating the SNR-t Bias of Diffusion Probabilistic Models Paper • 2604.16044 • Published 20 days ago • 74
LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model Paper • 2604.20796 • Published 15 days ago • 239
Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability Paper • 2604.06628 • Published 29 days ago • 323