Unify-Agent: A Unified Multimodal Agent for World-Grounded Image Synthesis Paper • 2603.29620 • Published 2 days ago • 37
facebook/dinov3-vit7b16-pretrain-lvd1689m Image Feature Extraction • 7B • Updated Aug 19, 2025 • 30.8k • 221
Towards Physically Plausible Video Generation via VLM Planning Paper • 2503.23368 • Published Mar 30, 2025 • 40
Towards Physically Plausible Video Generation via VLM Planning Paper • 2503.23368 • Published Mar 30, 2025 • 40