Kavin60606
/

pi0fast-dk1-lora-v1

Model card Files Files and versions

pi0-FAST DK1 LoRA Fine-tune v1

Fine-tuned pi0-FAST on egocentric manipulation data for the TRLC DK1 Follower robot arm.

Training Details

Parameter	Value
Base model	pi0-FAST (PaliGemma 2B LoRA)
Fine-tune method	LoRA (gemma_2b_lora)
Steps	1000
Batch size	32
Action dim	7 (6 arm joints + 1 gripper)
Action horizon	10
Max token length	350
delta joint actions	True

Dataset

Field	Value
HF Dataset	kavinrajkr60/dk1_egodex
Source	EgoDex egocentric manipulation episodes
Episodes	1800 (enriched ego part1 + part2)
Annotations	L1-L5 language annotations
FPS	10
Robot	TRLC DK1 Follower (6-DOF + parallel jaw gripper)

Robot: TRLC DK1 Follower

6-DOF revolute arm + 2 prismatic gripper joints
End effector: tool0
IK solver: PINK (Pinocchio, QP-based differential IK)
Action space: 7-dim [joint1..joint6, gripper_open]

Loss Curve

Step 0: loss=12.56
Step 300-360: loss=3.21-3.38
Step 1000: training complete (~55 min, single GPU)

Loss dropped from 12.56 to ~3.2 over 1000 steps.

W&B Run

https://wandb.ai/kavinrajkr60-dsfsd/roboinc/runs/22za2p1i

Known Issues

Token truncation: some samples exceed max_token_len=350 (up to 414 tokens). Increase in next run.

Downloads last month: -; Downloads are not tracked for this model. How to track

Video Preview

loading