PI0.5 Fine-tuned on Droyd UMI Data

Fine-tuned PI0.5 on bimanual manipulation demonstrations collected via UMI (Universal Manipulation Interface) gloves.

Training Details

Parameter Value
Base model lerobot/pi05_base (4.1B params)
Dataset 110 approved episodes, 11,765 frames @ 50fps
Session 158cc246-6594-410a-8d5a-db58a4101276
Steps 5,000
Batch size 32 effective (4 per GPU × 8 GPUs)
Hardware 8× NVIDIA H200 (Vast.ai)
Training time ~62 minutes
Final loss ~0.002
Optimizer AdamW (lr=2.5e-5, weight_decay=0.01)
LR schedule Cosine decay with 166-step warmup
Normalization Quantile (state & action)
Action space 7D absolute joint positions (6 joints + gripper)
Cameras ego (overhead) + wrist
Review clips Enabled (user-trimmed episode boundaries)

Input Features

Feature Shape Type
observation.images.cam_highbase_0_rgb (3, 224, 224) VISUAL
observation.images.cam_wristleft_wrist_0_rgb (3, 224, 224) VISUAL
observation.state (32,) STATE (quantile normalized)

Output Features

Feature Shape Type
action (7,) ACTION (quantile normalized)

W&B Run

droyd/lerobot/rmr4moga

Usage

from lerobot.common.policies.pi05.modeling_pi05 import PI05Policy

policy = PI05Policy.from_pretrained("rayhanfahmed/pi05_droyd_umi")
Downloads last month
124
Video Preview
loading

Model tree for rayhanfahmed/pi05_droyd_umi

Finetuned
(7)
this model