pravsels commited on
Commit
5433b7f
·
verified ·
1 Parent(s): 0c03ca1

Add model card

Browse files
Files changed (1) hide show
  1. README.md +61 -0
README.md ADDED
@@ -0,0 +1,61 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ tags:
4
+ - robotics
5
+ - pi0
6
+ - bin-packing
7
+ - openpi
8
+ ---
9
+
10
+ # pi0.5 Bin Pack — Single Dataset Baseline
11
+
12
+ Fine-tuned [pi0.5](https://github.com/Physical-Intelligence/openpi) checkpoint for coffee capsule bin packing, trained on a single dataset of ~200 teleoperated episodes. This serves as the base checkpoint for the reward recap experiments.
13
+
14
+ ## Config
15
+
16
+ - **Config name:** `pi05_bin_pack_coffee_capsules_delta_single_dataset`
17
+ - **Model:** pi0.5 (`pi05=True`, `action_horizon=50`)
18
+ - **Batch size:** 36
19
+ - **Learning rate:** 5e-5 cosine decay (10k warmup)
20
+ - **Optimizer:** AdamW (gradient clip norm 1.0)
21
+ - **EMA decay:** 0.999
22
+ - **Delta actions:** enabled
23
+ - **Weight init:** `weights/pi05_base/params` (pi0.5 base weights)
24
+ - **Training steps:** 30,000
25
+
26
+ ## Dataset
27
+
28
+ - `villekuosmanen/bin_pick_pack_coffee_capsules` (~200 teleoperated episodes)
29
+
30
+ ## Checkpoint Hash
31
+
32
+ Verify integrity with `tar cf - -C checkpoints/29999 params | sha256sum`.
33
+
34
+ | Step | SHA-256 |
35
+ |------|---------|
36
+ | 29,999 | `bb051b5a3ee10adae7ee5313102fd7157e49d77a12a3b9a48e0688617108f9b0` |
37
+
38
+ ## Downstream
39
+
40
+ This checkpoint is the weight init for the reward recap experiments:
41
+
42
+ - [pi05-bin-pack-reward-recap-positive-only](https://huggingface.co/pravsels/pi05-bin-pack-reward-recap-positive-only)
43
+ - [pi05-bin-pack-reward-recap-mixed](https://huggingface.co/pravsels/pi05-bin-pack-reward-recap-mixed)
44
+
45
+ ## Repo Structure
46
+
47
+ ```
48
+ assets/ # Norm stats + valid indices for inference
49
+ checkpoints/29999/params/ # Model weights (params only)
50
+ README.md # This file
51
+ ```
52
+
53
+ ## Usage
54
+
55
+ ```python
56
+ from openpi.training.config import get_config
57
+ from openpi.serving.policy_server import PolicyServer
58
+
59
+ config = get_config("pi05_bin_pack_coffee_capsules_delta_single_dataset")
60
+ server = PolicyServer(config, checkpoint_path="checkpoints/29999/params")
61
+ ```