SpatialTree: How Spatial Abilities Branch Out in MLLMs Paper • 2512.20617 • Published 2 days ago • 39
WorldPlay: Towards Long-Term Geometric Consistency for Real-Time Interactive World Modeling Paper • 2512.14614 • Published 9 days ago • 63
QwenLong-L1.5: Post-Training Recipe for Long-Context Reasoning and Memory Management Paper • 2512.12967 • Published 11 days ago • 98
Error-Free Linear Attention is a Free Lunch: Exact Solution from Continuous-Time Dynamics Paper • 2512.12602 • Published 11 days ago • 39
LEO-RobotAgent: A General-purpose Robotic Agent for Language-driven Embodied Operator Paper • 2512.10605 • Published 14 days ago • 6
Task adaptation of Vision-Language-Action model: 1st Place Solution for the 2025 BEHAVIOR Challenge Paper • 2512.06951 • Published 18 days ago • 3
MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing Paper • 2509.22186 • Published Sep 26 • 139
Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer Paper • 2511.22699 • Published 28 days ago • 212