Qwen-AgentWorld: Language World Models for General Agents Paper • 2606.24597 • Published 11 days ago • 144
DiffusionBench: On Holistic Evaluation of Diffusion Transformers Paper • 2606.24888 • Published 11 days ago • 11
Are Text-to-Image Models Inductivist Turkeys? A Counterfactual Benchmark for Causal Reasoning Paper • 2606.24548 • Published 11 days ago • 11
Sumi: Open Uniform Diffusion Language Model from Scratch Paper • 2606.19005 • Published 17 days ago • 11
PEFT-Arena: Understanding Parameter-Efficient Finetuning from a Stability-Plasticity Perspective Paper • 2605.28819 • Published May 27 • 8
Lens: Rethinking Training Efficiency for Foundational Text-to-Image Models Paper • 2605.21573 • Published May 20 • 111
You Only Need Minimal RLVR Training: Extrapolating LLMs via Rank-1 Trajectories Paper • 2605.21468 • Published May 20 • 51
SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture Paper • 2605.12500 • Published May 12 • 194
view article Article NEO-unify: Building Native Multimodal Unified Models End to End sensenova • Mar 5 • 167
D-OPSD: On-Policy Self-Distillation for Continuously Tuning Step-Distilled Diffusion Models Paper • 2605.05204 • Published May 6 • 28
OceanPile: A Large-Scale Multimodal Ocean Corpus for Foundation Models Paper • 2605.00877 • Published Apr 25 • 15
Visual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World Modeling Paper • 2604.28185 • Published Apr 30 • 92
Diffusion Templates: A Unified Plugin Framework for Controllable Diffusion Paper • 2604.24351 • Published Apr 27 • 11
LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model Paper • 2604.20796 • Published Apr 22 • 244
Understanding and Enforcing Weight Disentanglement in Task Arithmetic Paper • 2604.17078 • Published Apr 18 • 14
Elucidating the SNR-t Bias of Diffusion Probabilistic Models Paper • 2604.16044 • Published Apr 17 • 73