YAML Metadata Warning:empty or missing yaml metadata in repo card
Check out the documentation for more information.
pi0-FAST Cleaning Fine-tune (LoRA)
Fine-tuning pi0-FAST (2.9B) on YouTube egocentric cleaning videos using LoRA.
Pipeline
- YouTube egocentric cleaning videos โ frame extraction
- HaMeR (3D hand keypoints + MANO) โ hand trajectory extraction
- Franka IK retargeting (smoothed + velocity-clipped)
- VLM labeling (action/object/contact)
- LeRobot HDF5 dataset (361 episodes, 72k timesteps)
- pi0-FAST LoRA fine-tuning
Training Config
- Base model:
lerobot/pi0fast-base(2.9B params) - LoRA: rank=32, alpha=64, targets=q/k/v/o_proj (13.3M trainable, 0.45%)
- Batch size: 4 ร 8 grad_accum = 32 effective
- LR: 1e-4 cosine with 200 step warmup
- Dataset: 361 episodes, 72,311 timesteps
- Action: 7 Franka joints + 1 gripper (padded to 32)
- State: 3 EE position + 1 gripper (padded to 32)
Training Progress (Epoch 1, interrupted)
- Loss: 40.5 โ 6.4 (16,600/18,077 steps, ~92% of epoch 1)
- Time: ~91 minutes on A100-40GB
- Peak GPU: 29.6GB
Dataset
Dataset on Google Drive (pi0fast_dataset.hdf5, 6GB):
- 361 episodes from 2 cleaning videos
- 224x224 images + smoothed Franka joint trajectories
- Language instructions from VLM labeling
Resume Training
pip install lerobot h5py peft scipy
python3 train_pi0fast.py
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐ Ask for provider support