JoyAI-VL-Interaction: Real-Time Vision-Language Interaction Intelligence Paper • 2606.14777 • Published 7 days ago • 153
RhymeFlow: Training-Free Acceleration for Video Generation with Asynchronous Denoising Flow Scheduling Paper • 2606.06309 • Published 13 days ago • 7
MBench: A Comprehensive Benchmark on Memory Capability for Video World Models Paper • 2606.00793 • Published 9 days ago • 8
Solving Physics Olympiad via Reinforcement Learning on Physics Simulators Paper • 2604.11805 • Published Apr 13 • 16
SimRecon: SimReady Compositional Scene Reconstruction from Real Videos Paper • 2603.02133 • Published Mar 2 • 4
ScenePainter: Semantically Consistent Perpetual 3D Scene Generation with Concept Relation Alignment Paper • 2507.19058 • Published Jul 25, 2025 • 13
LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion Paper • 2507.02813 • Published Jul 3, 2025 • 60
VideoScene: Distilling Video Diffusion Model to Generate 3D Scenes in One Step Paper • 2504.01956 • Published Apr 2, 2025 • 41