MolmoMotion: Forecasting Point Trajectories in 3D with Language Instruction Paper • 2606.18558 • Published 10 days ago • 51
MolmoAct2 Eval Rollouts Collection Collection of the evaluation rollouts for MolmoAct2 conducted by Cortex AI • 61 items • Updated May 20 • 3
MolmoAct2 Models Collection Collection of the base models for MolmoAct2 • 6 items • Updated May 5 • 23
MolmoAct2 Finetuned Models Collection Collection of the fine-tuned models for MolmoAct2 • 7 items • Updated May 14 • 12
MolmoAct2 Datasets Collection Collection of robotics datasets for MolmoAct2 • 10 items • Updated 17 days ago • 13
MolmoAct2-BimanualYAM Dataset Collection Collection of the MolmoAct2-BimanualYAM Dataset • 741 items • Updated 17 days ago • 14
Molmo2-ER Datasets Collection Collection of the embodied reasoning datasets for MolmoAct2 • 11 items • Updated May 5 • 10
MolmoAct2: Action Reasoning Models for Real-world Deployment Paper • 2605.02881 • Published May 4 • 355
WildDet3D Collection This is the collection of WildDet3D artifacts, including demos, model checkpoints and data. https://github.com/allenai/WildDet3D • 8 items • Updated Apr 13 • 20
SAM2Act Collection Collection of the models, datasets, and benchmarks for SAM2Act • 5 items • Updated 28 days ago • 1
FailSafe: Reasoning and Recovery from Failures in Vision-Language-Action Models Paper • 2510.01642 • Published Oct 2, 2025 • 1
TOPReward: Token Probabilities as Hidden Zero-Shot Rewards for Robotics Paper • 2602.19313 • Published Feb 22 • 26
Recurrent-Depth VLA: Implicit Test-Time Compute Scaling of Vision-Language-Action Models via Latent Iterative Reasoning Paper • 2602.07845 • Published Feb 8 • 71
VLS: Steering Pretrained Robot Policies via Vision-Language Models Paper • 2602.03973 • Published Feb 3 • 22
MolmoAct Collection All models for the MolmoAct (Multimodal Open Language Model for Action) release. • 10 items • Updated May 4 • 37
MolmoAct Data Mixture Collection All datasets for the MolmoAct (Multimodal Open Language Model for Action) release. • 4 items • Updated Dec 23, 2025 • 20
MolmoAct: Action Reasoning Models that can Reason in Space Paper • 2508.07917 • Published Aug 11, 2025 • 45
SAM2Act: Integrating Visual Foundation Model with A Memory Architecture for Robotic Manipulation Paper • 2501.18564 • Published Jan 30, 2025 • 2
PointArena: Probing Multimodal Grounding Through Language-Guided Pointing Paper • 2505.09990 • Published May 15, 2025 • 12