liangsu9988 commited on
Commit
145d662
·
verified ·
1 Parent(s): dd7ec46

Update model card - remove author/date info

Browse files
Files changed (1) hide show
  1. README.md +36 -108
README.md CHANGED
@@ -11,135 +11,63 @@ tags:
11
  - tensorrt
12
  datasets:
13
  - libero
14
- metrics:
15
- - accuracy
16
  pipeline_tag: robotics
17
  library_name: pytorch
18
  ---
19
 
20
- # Turbo-Pi0.5: High-Performance VLA Model on NVIDIA Jetson Thor
21
 
22
- **14.7x Speedup** | **20.6 Hz Inference** | **98% LIBERO Spatial Accuracy**
23
 
24
- Turbo-Pi0.5 is an optimized implementation of the Pi0.5 Vision-Language-Action (VLA) model for NVIDIA Jetson Thor platform, achieving real-time robot control at 20.6 Hz.
25
-
26
- ## Model Description
27
-
28
- This model is based on the [Physical Intelligence Pi0](https://www.physicalintelligence.company/blog/pi0) architecture, optimized for edge deployment on NVIDIA Jetson Thor with the following techniques:
29
-
30
- - **KV Cache**: Reuse prefix K,V across denoising steps
31
- - **TensorRT Export**: ONNX to TensorRT FP16 engines
32
- - **Reduced Denoising Steps**: 10 to 3 steps with minimal accuracy loss
33
- - **Dual Stream Pipeline**: Vision-Action parallel execution
34
-
35
- ## Performance
36
-
37
- | Metric | Baseline | Turbo-Pi0.5 | Improvement |
38
- |--------|----------|-------------|-------------|
39
- | **Throughput** | 1.4 Hz | **20.6 Hz** | **14.7x** |
40
- | **Latency** | 714 ms | **48.5 ms** | **14.7x** |
41
- | **LIBERO Spatial** | - | **98%** | - |
42
-
43
- ### TensorRT Performance (NVIDIA Jetson Thor)
44
-
45
- | Denoising Steps | Latency | Throughput |
46
- |-----------------|---------|------------|
47
- | 10 steps | 131.7 ms | 7.6 Hz |
48
- | 5 steps | 72.3 ms | 13.8 Hz |
49
- | **3 steps** | **48.5 ms** | **20.6 Hz** |
50
- | 2 steps | 36.7 ms | 27.3 Hz |
51
-
52
- ## LIBERO Benchmark Results
53
-
54
- ### libero_spatial (10 tasks, 10 trials each)
55
-
56
- | Task | Success Rate |
57
- |------|--------------|
58
- | pick_up_black_bowl_between_plate_and_ramekin | 10/10 (100%) |
59
- | pick_up_black_bowl_next_to_ramekin | 10/10 (100%) |
60
- | pick_up_black_bowl_from_table_center | 10/10 (100%) |
61
- | pick_up_black_bowl_next_to_cookie_box | 10/10 (100%) |
62
- | pick_up_black_bowl_in_top_drawer | 10/10 (100%) |
63
- | pick_up_black_bowl_on_ramekin | 8/10 (80%) |
64
- | pick_up_black_bowl_on_cookie_box | 10/10 (100%) |
65
- | pick_up_black_bowl_on_stove | 10/10 (100%) |
66
- | pick_up_black_bowl_next_to_plate | 10/10 (100%) |
67
- | pick_up_black_bowl_on_wooden_cabinet | 10/10 (100%) |
68
- | **Total** | **98/100 (98%)** |
69
 
70
  ## Usage
71
 
72
- ### Installation
73
-
74
  ```bash
75
- # Clone repository
76
- git clone https://github.com/LiangSu8899/TurboPi.git
77
- cd TurboPi/openpi
78
-
79
- # Install dependencies
80
- pip install -e .
81
-
82
  # Download model
83
- huggingface-cli download liangsu9988/Turbo-Pi0.5 --local-dir ~/.cache/openpi/checkpoints/pi05_libero
84
- ```
85
 
86
- ### Inference
87
-
88
- ```bash
89
- # Start policy server
90
- python scripts/serve_policy.py \
91
- --env=LIBERO \
92
- --port=8000 \
93
- policy:checkpoint \
94
- --policy.config=pi05_libero \
95
- --policy.dir=~/.cache/openpi/checkpoints/pi05_libero
96
  ```
97
 
98
- ### Python API
99
 
100
- ```python
101
- from openpi_client import WebsocketClientPolicy
 
102
 
103
- # Connect to policy server
104
- policy = WebsocketClientPolicy(host="localhost", port=8000)
105
 
106
- # Get action from observation
107
- action = policy.get_action({
108
- "images": {"cam_high": image, "cam_left_wrist": left_img, "cam_right_wrist": right_img},
109
- "state": robot_state,
110
- "prompt": "pick up the black bowl"
111
- })
112
- ```
113
 
114
- ## Model Architecture
115
 
116
- - **Vision Encoder**: SigLIP-SO400M (400M parameters)
117
- - **Language Model**: Gemma 2B
118
- - **Action Expert**: Gemma 300M
119
- - **Total Parameters**: ~2.7B
120
-
121
- ## Hardware Requirements
122
-
123
- - NVIDIA Jetson Thor (recommended) or GPU with 8GB+ VRAM
124
- - JetPack 7.1+ for Jetson
125
- - CUDA 12.0+
126
-
127
- ## Citation
128
-
129
- ```bibtex
130
- @misc{turbopi05,
131
- title={Turbo-Pi0.5: High-Performance VLA Model on NVIDIA Jetson Thor},
132
- author={Liang Su},
133
- year={2026},
134
- url={https://github.com/LiangSu8899/TurboPi}
135
- }
136
- ```
137
 
138
- ## Acknowledgments
139
 
140
- - [Physical Intelligence](https://www.physicalintelligence.company/) for the original Pi0 model
141
- - [OpenPi](https://github.com/Physical-Intelligence/openpi) for the open-source implementation
142
- - NVIDIA for Jetson Thor platform and TensorRT tools
143
 
144
  ## License
145
 
 
11
  - tensorrt
12
  datasets:
13
  - libero
 
 
14
  pipeline_tag: robotics
15
  library_name: pytorch
16
  ---
17
 
18
+ # Turbo-Pi0.5
19
 
20
+ **14.7x faster Pi0.5 VLA model optimized for NVIDIA Jetson Thor**
21
 
22
+ | Metric | Before | After |
23
+ |--------|--------|-------|
24
+ | Inference Speed | 1.4 Hz | **20.6 Hz** |
25
+ | Latency | 714 ms | **48.5 ms** |
26
+ | LIBERO Spatial | - | **98%** |
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
27
 
28
  ## Usage
29
 
 
 
30
  ```bash
 
 
 
 
 
 
 
31
  # Download model
32
+ huggingface-cli download liangsu9988/Turbo-Pi0.5 \
33
+ --local-dir ~/.cache/openpi/checkpoints/pi05_libero
34
 
35
+ # Run inference server
36
+ python scripts/serve_policy.py --env=LIBERO --port=8000 \
37
+ policy:checkpoint --policy.config=pi05_libero
 
 
 
 
 
 
 
38
  ```
39
 
40
+ ## Architecture
41
 
42
+ - **Vision**: SigLIP-SO400M
43
+ - **Language**: Gemma 2B
44
+ - **Action**: Gemma 300M Expert
45
 
46
+ ## Optimizations
 
47
 
48
+ - KV Cache for efficient denoising
49
+ - TensorRT FP16 acceleration
50
+ - Reduced denoising steps (10 3)
 
 
 
 
51
 
52
+ ## LIBERO Spatial Results (98%)
53
 
54
+ | Task | Success |
55
+ |------|---------|
56
+ | pick_up_black_bowl_between_plate_and_ramekin | 100% |
57
+ | pick_up_black_bowl_next_to_ramekin | 100% |
58
+ | pick_up_black_bowl_from_table_center | 100% |
59
+ | pick_up_black_bowl_next_to_cookie_box | 100% |
60
+ | pick_up_black_bowl_in_top_drawer | 100% |
61
+ | pick_up_black_bowl_on_ramekin | 80% |
62
+ | pick_up_black_bowl_on_cookie_box | 100% |
63
+ | pick_up_black_bowl_on_stove | 100% |
64
+ | pick_up_black_bowl_next_to_plate | 100% |
65
+ | pick_up_black_bowl_on_wooden_cabinet | 100% |
 
 
 
 
 
 
 
 
 
66
 
67
+ ## Links
68
 
69
+ - **Code**: [GitHub](https://github.com/LiangSu8899/TurboPi)
70
+ - **Base**: [OpenPi](https://github.com/Physical-Intelligence/openpi)
 
71
 
72
  ## License
73