Qwen-Image-Agent: Bridging the Context Gap in Real-World Image Generation Paper • 2606.26907 • Published 3 days ago • 41
HYDRA-X: Native Unified Multimodal Models with Holistic Visual Tokenizers Paper • 2606.13289 • Published 17 days ago • 29
Representation Forcing for Bottleneck-Free Unified Multimodal Models Paper • 2605.31604 • Published about 1 month ago • 63
view article Article NEO-unify: Building Native Multimodal Unified Models End to End sensenova • Mar 5 • 167
DrivingGen: A Comprehensive Benchmark for Generative Video World Models in Autonomous Driving Paper • 2601.01528 • Published Jan 4 • 19
Openpi Comet: Competition Solution For 2025 BEHAVIOR Challenge Paper • 2512.10071 • Published Dec 10, 2025 • 18
EmbodiedOneVision: Interleaved Vision-Text-Action Pretraining for General Robot Control Paper • 2508.21112 • Published Aug 28, 2025 • 78