lerobot/svla_so101_pickplace
Viewer • Updated • 11.9k • 3.04k • 38
How to use y1y2y3/smolvla_base1 with LeRobot:
# See https://github.com/huggingface/lerobot?tab=readme-ov-file#installation for more details git clone https://github.com/huggingface/lerobot.git cd lerobot pip install -e .[smolvla]
# Launch finetuning on your dataset python lerobot/scripts/train.py \ --policy.path=y1y2y3/smolvla_base1 \ --dataset.repo_id=lerobot/svla_so101_pickplace \ --batch_size=64 \ --steps=20000 \ --output_dir=outputs/train/my_smolvla \ --job_name=my_smolvla_training \ --policy.device=cuda \ --wandb.enable=true
# Run the policy using the record function
python -m lerobot.record \
--robot.type=so101_follower \
--robot.port=/dev/ttyACM0 \ # <- Use your port
--robot.id=my_blue_follower_arm \ # <- Use your robot id
--robot.cameras="{ front: {type: opencv, index_or_path: 8, width: 640, height: 480, fps: 30}}" \ # <- Use your cameras
--dataset.single_task="Grasp a lego block and put it in the bin." \ # <- Use the same task description you used in your dataset recording
--dataset.repo_id=HF_USER/dataset_name \ # <- This will be the dataset name on HF Hub
--dataset.episode_time_s=50 \
--dataset.num_episodes=10 \
--policy.path=y1y2y3/smolvla_base1Resources and technical documentation:
Train using Google Colab Notebook
Designed by Hugging Face.
This model has 450M parameters in total. You can use inside the LeRobot library.
Before proceeding to the next steps, you need to properly install the environment by following Installation Guide on the docs.
Install smolvla extra dependencies:
pip install -e ".[smolvla]"
Example of finetuning the smolvla pretrained model (smolvla_base):
python lerobot/scripts/train.py \
--policy.path=lerobot/smolvla_base \
--dataset.repo_id=lerobot/svla_so101_pickplace \
--batch_size=64 \
--steps=20000 \
--output_dir=outputs/train/my_smolvla \
--job_name=my_smolvla_training \
--policy.device=cuda \
--wandb.enable=true
Example of finetuning the smolvla neural network with pretrained VLM and action expert intialized from scratch:
python lerobot/scripts/train.py \
--dataset.repo_id=lerobot/svla_so101_pickplace \
--batch_size=64 \
--steps=200000 \
--output_dir=outputs/train/my_smolvla \
--job_name=my_smolvla_training \
--policy.device=cuda \
--wandb.enable=true