MolmoMotion: Forecasting Point Trajectories in 3D with Language Instruction Paper • 2606.18558 • Published 16 days ago • 53
Affogato: Learning Open-Vocabulary Affordance Grounding with Automated Data Generation at Scale Paper • 2506.12009 • Published Jun 13, 2025 • 2
MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video Understanding Paper • 2404.05726 • Published Apr 8, 2024 • 23
DragDiffusion: Harnessing Diffusion Models for Interactive Point-based Image Editing Paper • 2306.14435 • Published Jun 26, 2023 • 21