MolmoAct2 Models Collection Collection of the base models for MolmoAct2 • 6 items • Updated 3 days ago • 13
MolmoAct2 Finetuned Models Collection Collection of the fine-tuned models for MolmoAct2 • 6 items • Updated 3 days ago • 4
MolmoAct2 Datasets Collection Collection of robotics datasets for MolmoAct2 • 8 items • Updated 3 days ago • 8
MolmoAct2-BimanualYAM Dataset Collection Collection of the MolmoAct2-BimanualYAM Dataset • 740 items • Updated 2 days ago • 11
Molmo2-ER Datasets Collection Collection of the embodied reasoning datasets for MolmoAct2 • 11 items • Updated 3 days ago • 6
MolmoAct2: Action Reasoning Models for Real-world Deployment Paper • 2605.02881 • Published 4 days ago • 258
WildDet3D Collection This is the collection of WildDet3D artifacts, including demos, model checkpoints and data. https://github.com/allenai/WildDet3D • 8 items • Updated 25 days ago • 17
WildDet3D: Scaling Promptable 3D Detection in the Wild Paper • 2604.08626 • Published 29 days ago • 245
SAM2Act Collection Integrating Visual Foundation Model with A Memory Architecture for Robotic Manipulation • 3 items • Updated Mar 27 • 1
FailSafe: Reasoning and Recovery from Failures in Vision-Language-Action Models Paper • 2510.01642 • Published Oct 2, 2025 • 1
TOPReward: Token Probabilities as Hidden Zero-Shot Rewards for Robotics Paper • 2602.19313 • Published Feb 22 • 26
Recurrent-Depth VLA: Implicit Test-Time Compute Scaling of Vision-Language-Action Models via Latent Iterative Reasoning Paper • 2602.07845 • Published Feb 8 • 71
VLS: Steering Pretrained Robot Policies via Vision-Language Models Paper • 2602.03973 • Published Feb 3 • 22
MolmoAct Collection All models for the MolmoAct (Multimodal Open Language Model for Action) release. • 10 items • Updated 3 days ago • 35
MolmoAct Data Mixture Collection All datasets for the MolmoAct (Multimodal Open Language Model for Action) release. • 4 items • Updated Dec 23, 2025 • 18
MolmoAct: Action Reasoning Models that can Reason in Space Paper • 2508.07917 • Published Aug 11, 2025 • 45
SAM2Act: Integrating Visual Foundation Model with A Memory Architecture for Robotic Manipulation Paper • 2501.18564 • Published Jan 30, 2025 • 2
PointArena: Probing Multimodal Grounding Through Language-Guided Pointing Paper • 2505.09990 • Published May 15, 2025 • 12