Learning Fine-Grained Bimanual Manipulation with Low-Cost Hardware
Paper
•
2304.13705
•
Published
•
6
A lightweight Action Chunking with Transformers (ACT) model trained on the ALOHA simulation Insertion task. This is a difficult bimanual coordination task with lower success rate compared to TransferCube.
| Property | Value |
|---|---|
| Architecture | ACT (Action Chunking with Transformers) |
| Parameters | 52M |
| Task | ALOHA Insertion-v0 |
| Training Steps | 200,000 |
| Batch Size | 32 |
| Success Rate | ~15% |
The Insertion task requires a bimanual robot to:
⚠️ This is a difficult task requiring precise bimanual coordination. Success rate is significantly lower than TransferCube.
pip install lerobot gym-aloha
lerobot-train \
--policy.type=act \
--dataset.repo_id=lerobot/aloha_sim_insertion_human_image \
--env.type=aloha \
--env.task=AlohaInsertion-v0 \
--batch_size=32 \
--steps=200000 \
--eval.n_episodes=10 \
--eval_freq=20000 \
--save_freq=20000 \
--output_dir=./outputs/act_aloha_insertion \
--wandb.enable=false \
--policy.push_to_hub=false
lerobot-eval \
--policy.path=LeTau/act_aloha_insertion \
--env.type=aloha \
--env.task=AlohaInsertion-v0 \
--eval.batch_size=1 \
--eval.n_episodes=20
lerobot-train \
--resume=true \
--config_path=LeTau/act_aloha_insertion/train_config.json \
--steps=300000
| Evaluation | Episodes | Success Rate | Avg Sum Reward |
|---|---|---|---|
| Training (120K) | 10 | 10% | 40.3 |
| Training (200K) | 10 | 20% | 40.4 |
| Independent | 20 | 15% | 51.2 |
Expected success rate: 15-20%
| Task | Difficulty | Success Rate |
|---|---|---|
| TransferCube | Easy | 35-42% |
| Insertion | Hard | 15-20% |
Sum Rewards: [0.0, 0.0, 0.0, 240.0, 121.0, 0.0, 0.0, 0.0, 43.0, 0.0,
256.0, 0.0, 0.0, 321.0, 0.0, 0.0, 0.0, 0.0, 43.0, 0.0]
Successes: 3/20 episodes
@article{zhao2023learning,
title={Learning Fine-Grained Bimanual Manipulation with Low-Cost Hardware},
author={Zhao, Tony Z and Kumar, Vikash and Levine, Sergey and Finn, Chelsea},
journal={arXiv preprint arXiv:2304.13705},
year={2023}
}