World Tracing: Generative Pixel-Aligned Geometry Beyond the Visible Paper • 2606.13652 • Published 15 days ago • 15
RepFusion: Leveraging Multimodal Priors for Denoising in Representation Space Paper • 2606.14700 • Published 14 days ago • 18
Gamma-World: Generative Multi-Agent World Modeling Beyond Two Players Paper • 2605.28816 • Published 30 days ago • 431
SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture Paper • 2605.12500 • Published May 12 • 194
Stream-T1: Test-Time Scaling for Streaming Video Generation Paper • 2605.04461 • Published May 6 • 109
Think in Strokes, Not Pixels: Process-Driven Image Generation via Interleaved Reasoning Paper • 2604.04746 • Published Apr 8 • 73
SK-Adapter: Skeleton-Based Structural Control for Native 3D Generation Paper • 2603.14152 • Published Mar 14 • 7
XToM: Exploring the Multilingual Theory of Mind for Large Language Models Paper • 2506.02461 • Published Jun 3, 2025 • 3
ReasonNavi: Human-Inspired Global Map Reasoning for Zero-Shot Embodied Navigation Paper • 2602.15864 • Published Jan 26
SK-Adapter: Skeleton-Based Structural Control for Native 3D Generation Paper • 2603.14152 • Published Mar 14 • 7
SK-Adapter: Skeleton-Based Structural Control for Native 3D Generation Paper • 2603.14152 • Published Mar 14 • 7