Crab SmolVLA โ€” Left Arm

Fine-tuned SmolVLA for left-arm (6-DOF) manipulation on the Crab bimanual mobile manipulator.

Model Details

  • Base model: lerobot/smolvla_base (450M params)
  • Action space: 6-DOF absolute joint positions (indices 0โ€“5)
  • Training data: 27 demonstrations across 3 tasks (eggs, can, waffles)
  • Best validation loss: 0.46
  • Training: 50K steps, RTX 4090, ~3.5 hrs

Usage

import torch
checkpoint = torch.load("best/model.pt", map_location="cpu")

See Advanced-Robotic-Manipulation/crab for full inference pipeline.

Training Config

Training configuration is provided in config.yaml. Key settings:

  • Image size: 256ร—256, 3 cameras
  • Data augmentation: ColorJitter + RandomResizedCrop
  • Optimizer: AdamW, lr=1e-4, weight_decay=0.01
  • Effective batch size: 32 (8 ร— 4 gradient accumulation)

Citation

If you use this model, please cite our paper (coming soon).

Downloads last month
-
Video Preview
loading

Model tree for armteam/crab-smolvla-left-arm

Finetuned
(4892)
this model