pravsels commited on
Commit
19f91e5
·
verified ·
1 Parent(s): 94df945

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +72 -0
README.md ADDED
@@ -0,0 +1,72 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ tags:
4
+ - robotics
5
+ - pi0
6
+ - arx5
7
+ - multitask
8
+ - openpi
9
+ ---
10
+
11
+ # pi0.5 ARX5 Multitask Micro Baseline
12
+
13
+ Fine-tuned [pi0.5](https://github.com/Physical-Intelligence/openpi) checkpoint for multi-task manipulation with ARX5 arms, trained on a 14-dataset micro mix with valid-index filtering (human-controlled + successful episodes only).
14
+
15
+ ## Experiment
16
+
17
+ - **Objective:** Fine-tune PI0.5 on the micro training mix with baseline valid indices; compare later to advantaged variant.
18
+ - **Weight init:** `weights/pi05_base/params` (pi0.5 base weights).
19
+ - **Total steps:** 30,000 (completed)
20
+ - **Final loss:** 0.0107 (step 29,900)
21
+
22
+ ## Config
23
+
24
+ - **Config name:** `pi05_arx5_multitask_micro_baseline`
25
+ - **Model:** pi0.5 (`pi05=True`, `action_horizon=50`)
26
+ - **Batch size:** 36
27
+ - **Learning rate:** 5e-5 cosine decay (1k warmup, decay over 100k steps)
28
+ - **Optimizer:** AdamW (gradient clip norm 1.0)
29
+ - **EMA decay:** 0.999
30
+ - **Delta actions:** enabled (delta joints, absolute grippers)
31
+ - **Per-timestep action normalization:** enabled (auto from delta actions)
32
+ - **Action space:** 14D bimanual (single-arm 7D padded to 14D with loss masking)
33
+
34
+ ## Dataset
35
+
36
+ 14 LeRobot datasets from `training_mix_micro.json` (all `villekuosmanen/*` repos). Filtered by `valid_indices.txt` to include only human-controlled, successful episodes.
37
+
38
+ ## Checkpoint Hashes
39
+
40
+ Verify integrity with:
41
+
42
+ ```bash
43
+ cd checkpoints/<step> && find params -type f | sort | xargs sha256sum | sha256sum
44
+ ```
45
+
46
+ | Step | Loss | SHA-256 |
47
+ |------|------|---------|
48
+ | 25,000 | 0.0119 | `69ee51b80032d3a4424bd3834167fdd4d839701ab3b267c73ae6b7386922f1f8` |
49
+ | 29,999 | 0.0107 | `450e1c86c1d95ccb7215cc3662b90c6b56fb483006b640dfa2bc70bfa2593c01` |
50
+
51
+ ## W&B
52
+
53
+ - [Training dashboard](https://wandb.ai/pravsels/arx5_multitask/runs/gtk5f6zw)
54
+
55
+ ## Repo Structure
56
+
57
+ ```
58
+ assets/ # Norm stats, valid_indices.txt, training_mix_micro.json
59
+ checkpoints/<step>/params/ # Model weights (params only)
60
+ README.md # This file
61
+ TRAINING_LOG.md # Training log
62
+ ```
63
+
64
+ ## Usage
65
+
66
+ ```python
67
+ from openpi.training.config import get_config
68
+ from openpi.serving.policy_server import PolicyServer
69
+
70
+ config = get_config("pi05_arx5_multitask_micro_baseline")
71
+ server = PolicyServer(config, checkpoint_path="checkpoints/<step>/params")
72
+ ```