Cache-SCA
/

pi05_teleop_open_drawer

imitation-learning

behavior-cloning

visuomotor-policy

vision-language-action

Model card Files Files and versions

pi05_teleop_open_drawer / README.md

vpraise00's picture

Add files using upload-large-folder tool

7970733 verified about 2 months ago

|

History Blame Contribute Delete

3.02 kB

	---
	library_name: lerobot
	base_model: lerobot/pi05_base
	datasets:
	- CoRL2026-CSI/SO101-Teleop-Open_drawer_100epi
	tags:
	- lerobot
	- robotics
	- robot-learning
	- imitation-learning
	- behavior-cloning
	- visuomotor-policy
	- vision-language-action
	- pi05
	- pi0.5
	- openpi
	- paligemma
	- so101
	- teleoperation
	- open-drawer
	- pytorch
	---

	# Pi0.5 Teleop Open Drawer

	This repository contains a LeRobot Pi0.5 policy fine-tuned for the SO101
	`open_drawer` teleoperation task.

	The checkpoint was fine-tuned from `lerobot/pi05_base` on
	`CoRL2026-CSI/SO101-Teleop-Open_drawer_100epi` and saved after the final
	training step.

	## Model Details

	- Policy type: `pi05`
	- Base model: `lerobot/pi05_base`
	- Training dataset: `CoRL2026-CSI/SO101-Teleop-Open_drawer_100epi`
	- Task: open a drawer with SO101 teleoperation demonstrations
	- Checkpoint: final checkpoint at step `2200`
	- Action dimension: `6`
	- State dimension: `32`
	- Image resolution: `224 x 224`
	- Precision: `bfloat16`
	- Training framework: LeRobot

	## Input and Output Features

	The policy checkpoint is configured with the following observation features:

	- `observation.images.base_0_rgb`: visual input, shape `[3, 224, 224]`
	- `observation.images.left_wrist_0_rgb`: visual input, shape `[3, 224, 224]`
	- `observation.images.right_wrist_0_rgb`: visual input, shape `[3, 224, 224]`
	- `observation.state`: robot state, shape `[32]`

	The output feature is:

	- `action`: robot action, shape `[6]`

	The saved policy preprocessor maps dataset camera keys as follows:

	- `observation.images.top` -> `observation.images.base_0_rgb`
	- `observation.images.left_wrist` -> `observation.images.left_wrist_0_rgb`

	## Training

	Training used the following main settings:

	- Steps: `2200`
	- Batch size: `32`
	- Gradient accumulation: `4`
	- Optimizer: AdamW
	- Learning rate: `2.5e-5`
	- Scheduler: cosine decay with warmup
	- Image augmentation: enabled
	- Final training loss: `0.0379133597`
	- Final train steps logged: `2200`
	- Final train samples logged: `140800`
	- Final train epochs logged: `6.2694808086`

	No separate evaluation results are included in this repository.

	## Usage

	Use the model as a LeRobot policy by pointing `--policy.path` at this Hub repo:

	```bash
	lerobot-record \
	--robot.type=<your_robot_type> \
	--dataset.repo_id=<your_eval_dataset_repo> \
	--policy.path=CoRL2026-CSI/pi05_teleop_open_drawer \
	--episodes=10
	```

	You can also load it directly in Python:

	```python
	from lerobot.policies.pi05.modeling_pi05 import PI05Policy

	policy = PI05Policy.from_pretrained("CoRL2026-CSI/pi05_teleop_open_drawer")
	policy.eval()
	```

	## Files

	- `model.safetensors`: policy weights
	- `config.json`: Pi0.5 policy configuration
	- `train_config.json`: training configuration
	- `policy_preprocessor.json`: saved policy input processor pipeline
	- `policy_postprocessor.json`: saved policy output processor pipeline
	- `*_processor.safetensors`: normalization and unnormalization state