VLABench
/

pi05-primitive-10task

Model card Files Files and versions

CyberDJ commited on Nov 11, 2025

Commit

f658a8b

·

verified ·

1 Parent(s): 8e3df8b

Update README.md

Files changed (1) hide show

README.md +23 -1

README.md CHANGED Viewed

@@ -1,9 +1,31 @@
 ---
 license: apache-2.0
 ---
 This repository provides the official release of the Pi 0.5 model with the stop-gradient mechanism enabled, trained with the whole VLABench's official primitive tasks dataset.
-This checkpoint is trained on 8 H100 for 30k iterations, with 5000 episodes data acrossing 10 tasks.
 The reference success rate of this model is:
 | Track                      | add_condiment | insert_flower | select_book | select_chemistry_tube | select_drink | select_fruit | select_mahjong | select_painting | select_poker | select_toy | Avg_SR |
 |----------------------------|---------------|---------------|-----------------|-----------------|--------------------|--------------------|-----------------|-----------------|-------------------|-------------------|--------|

 ---
 license: apache-2.0
 ---
+# Pi05 official implementation trained on VLABench datasets.
 This repository provides the official release of the Pi 0.5 model with the stop-gradient mechanism enabled, trained with the whole VLABench's official primitive tasks dataset.
+# Evaluation
+To run this checkpoint, please clone this repo: https://github.com/Shiduo-zh/openpi, and checkout to the branch `pi05`.
+Assume that you download this checkpoints and put it in the directory `checkpoints`, to run the policy as server, please run:
+```sh
+bash vla_bench_scipts/serve_policy.sh pi05_ft_vlabench_primitive checkpoints/VLABench/pi05-primitive-10task/sg_pi05_base/29999/
+```
+After serving the policy, open another terminal and run:
+```sh
+bash vla_bench_scipts/multi_run_vlabench.sh <Your path to store the evaluate results>
+```
+# Train
+To reproduce the training result, please run the training script with the config `pi05_ft_vlabench_primitive`.
+```sh
+XLA_PYTHON_CLIENT_MEM_FRACTION=0.95 uv run scripts/train.py pi05_ft_vlabench_primitive --exp-name=pi05_ft_vlabench_primitive --overwrite
+```
+Our checkpoint is trained on 8 H100 for 30k iterations, with 5000 episodes data acrossing 10 tasks.
+# Reference Results
 The reference success rate of this model is:
 | Track                      | add_condiment | insert_flower | select_book | select_chemistry_tube | select_drink | select_fruit | select_mahjong | select_painting | select_poker | select_toy | Avg_SR |
 |----------------------------|---------------|---------------|-----------------|-----------------|--------------------|--------------------|-----------------|-----------------|-------------------|-------------------|--------|