World Pilot: Steering Vision-Language-Action Models with World-Action Priors Paper • 2606.12403 • Published 1 day ago • 20
UniVidX: A Unified Multimodal Framework for Versatile Video Generation via Diffusion Priors Paper • 2605.00658 • Published May 1 • 84