Kavin60606
/

pi0fast-cleaning-finetune

Model card Files Files and versions

YAML Metadata Warning:empty or missing yaml metadata in repo card

Check out the documentation for more information.

pi0-FAST Cleaning Fine-tune (LoRA)

Fine-tuning pi0-FAST (2.9B) on YouTube egocentric cleaning videos using LoRA.

Pipeline

YouTube egocentric cleaning videos → frame extraction
HaMeR (3D hand keypoints + MANO) → hand trajectory extraction
Franka IK retargeting (smoothed + velocity-clipped)
VLM labeling (action/object/contact)
LeRobot HDF5 dataset (361 episodes, 72k timesteps)
pi0-FAST LoRA fine-tuning

Training Config

Base model: lerobot/pi0fast-base (2.9B params)
LoRA: rank=32, alpha=64, targets=q/k/v/o_proj (13.3M trainable, 0.45%)
Batch size: 4 × 8 grad_accum = 32 effective
LR: 1e-4 cosine with 200 step warmup
Dataset: 361 episodes, 72,311 timesteps
Action: 7 Franka joints + 1 gripper (padded to 32)
State: 3 EE position + 1 gripper (padded to 32)

Training Progress (Epoch 1, interrupted)

Loss: 40.5 → 6.4 (16,600/18,077 steps, ~92% of epoch 1)
Time: ~91 minutes on A100-40GB
Peak GPU: 29.6GB

Dataset

Dataset on Google Drive (pi0fast_dataset.hdf5, 6GB):

361 episodes from 2 cleaning videos
224x224 images + smoothed Franka joint trajectories
Language instructions from VLM labeling

Resume Training

pip install lerobot h5py peft scipy
python3 train_pi0fast.py

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support