Visual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World Modeling Paper ⢠2604.28185 ⢠Published 17 days ago ⢠90
Refinement via Regeneration: Enlarging Modification Space Boosts Image Refinement in Unified Multimodal Models Paper ⢠2604.25636 ⢠Published 19 days ago ⢠24
ViGoR-Bench: How Far Are Visual Generative Models From Zero-Shot Visual Reasoners? Paper ⢠2603.25823 ⢠Published Mar 26 ⢠43
HopChain: Multi-Hop Data Synthesis for Generalizable Vision-Language Reasoning Paper ⢠2603.17024 ⢠Published Mar 17 ⢠109
The Flexibility Trap: Why Arbitrary Order Limits Reasoning Potential in Diffusion Language Models Paper ⢠2601.15165 ⢠Published Jan 21 ⢠74
Few-Step Distillation for Text-to-Image Generation: A Practical Guide Paper ⢠2512.13006 ⢠Published Dec 15, 2025 ⢠10
Phased DMD: Few-step Distribution Matching Distillation via Score Matching within Subintervals Paper ⢠2510.27684 ⢠Published Oct 31, 2025 ⢠23
IMG: Calibrating Diffusion Models via Implicit Multimodal Guidance Paper ⢠2509.26231 ⢠Published Sep 30, 2025 ⢠18
Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning Paper ⢠2506.01939 ⢠Published Jun 2, 2025 ⢠190
UI-Genie: A Self-Improving Approach for Iteratively Boosting MLLM-based Mobile GUI Agents Paper ⢠2505.21496 ⢠Published May 27, 2025 ⢠38
Absolute Zero: Reinforced Self-play Reasoning with Zero Data Paper ⢠2505.03335 ⢠Published May 6, 2025 ⢠192
Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model? Paper ⢠2504.13837 ⢠Published Apr 18, 2025 ⢠141
4D LangSplat: 4D Language Gaussian Splatting via Multimodal Large Language Models Paper ⢠2503.10437 ⢠Published Mar 13, 2025 ⢠34
ART: Anonymous Region Transformer for Variable Multi-Layer Transparent Image Generation Paper ⢠2502.18364 ⢠Published Feb 25, 2025 ⢠36
Cosmos Collection ā ļø This collection is archived. š https://huggingface.co/collections/nvidia/nvidia-cosmos-2 ⢠14 items ⢠Updated 8 days ago ⢠302
LLM-based Optimization of Compound AI Systems: A Survey Paper ⢠2410.16392 ⢠Published Oct 21, 2024 ⢠16
Efficient Diffusion Transformer with Step-wise Dynamic Attention Mediators Paper ⢠2408.05710 ⢠Published Aug 11, 2024 ⢠2