DiffusionBench: On Holistic Evaluation of Diffusion Transformers Paper • 2606.24888 • Published 3 days ago • 9
UniDDT: Unifying Multimodal Understanding and Generation with Decoupled Diffusion Transformer Paper • 2606.16255 • Published 11 days ago • 14
RepFusion: Leveraging Multimodal Priors for Denoising in Representation Space Paper • 2606.14700 • Published 14 days ago • 18
HYDRA-X: Native Unified Multimodal Models with Holistic Visual Tokenizers Paper • 2606.13289 • Published 15 days ago • 29
Representation Forcing for Bottleneck-Free Unified Multimodal Models Paper • 2605.31604 • Published 28 days ago • 61
PiD: Fast and High-Resolution Latent Decoding with Pixel Diffusion Paper • 2605.23902 • Published May 22 • 46
SNLP: Layer-Parallel Inference via Structured Newton Corrections Paper • 2605.17842 • Published May 18 • 4
AnyFlow: Any-Step Video Diffusion Model with On-Policy Flow Map Distillation Paper • 2605.13724 • Published May 13 • 105
Strips as Tokens: Artist Mesh Generation with Native UV Segmentation Paper • 2604.09132 • Published Apr 10 • 56
VGGRPO: Towards World-Consistent Video Generation with 4D Latent Reward Paper • 2603.26599 • Published Mar 27 • 67
UniGRPO: Unified Policy Optimization for Reasoning-Driven Visual Generation Paper • 2603.23500 • Published Mar 24 • 36