PhysisForcing: Physics Reinforced World Simulator for Robotic Manipulation Paper • 2606.28128 • Published 4 days ago • 31
PhysisForcing: Physics Reinforced World Simulator for Robotic Manipulation Paper • 2606.28128 • Published 4 days ago • 31
HumanScale: Egocentric Human Video Can Outperform Real-Robot Data for Embodied Pretraining Paper • 2606.20521 • Published 12 days ago • 14
HumanScale: Egocentric Human Video Can Outperform Real-Robot Data for Embodied Pretraining Paper • 2606.20521 • Published 12 days ago • 14
HumanScale: Egocentric Human Video Can Outperform Real-Robot Data for Embodied Pretraining Paper • 2606.20521 • Published 12 days ago • 14
TS-Attn: Temporal-wise Separable Attention for Multi-Event Video Generation Paper • 2604.19473 • Published Apr 21
Fast-dDrive: Efficient Block-Diffusion VLM for Autonomous Driving Paper • 2605.23163 • Published May 25 • 17
StableVLA: Towards Robust Vision-Language-Action Models without Extra Data Paper • 2605.18287 • Published May 18 • 15
StableVLA: Towards Robust Vision-Language-Action Models without Extra Data Paper • 2605.18287 • Published May 18 • 15
StableVLA: Towards Robust Vision-Language-Action Models without Extra Data Paper • 2605.18287 • Published May 18 • 15
HumanNet: Scaling Human-centric Video Learning to One Million Hours Paper • 2605.06747 • Published May 7 • 55
HumanNet: Scaling Human-centric Video Learning to One Million Hours Paper • 2605.06747 • Published May 7 • 55
I2V-Adapter: A General Image-to-Video Adapter for Video Diffusion Models Paper • 2312.16693 • Published Dec 27, 2023 • 14
VideoTetris: Towards Compositional Text-to-Video Generation Paper • 2406.04277 • Published Jun 6, 2024 • 25
Enhancing Spatial Understanding in Image Generation via Reward Modeling Paper • 2602.24233 • Published Feb 27 • 60