wsagi commited on 15 days ago

Commit

fc41b60

verified ·

1 Parent(s): 795d1db

Add files using upload-large-folder tool

Browse files

Files changed (19) hide show

README.md +149 -0
config.json +107 -0
model-00001-of-00011.safetensors +3 -0
model-00002-of-00011.safetensors +3 -0
model-00003-of-00011.safetensors +3 -0
model-00004-of-00011.safetensors +3 -0
model-00005-of-00011.safetensors +3 -0
model-00006-of-00011.safetensors +3 -0
model-00007-of-00011.safetensors +3 -0
model-00008-of-00011.safetensors +3 -0
model-00009-of-00011.safetensors +3 -0
model-00010-of-00011.safetensors +3 -0
model-00011-of-00011.safetensors +3 -0
model.safetensors.index.json +0 -0
policy_postprocessor.json +32 -0
policy_postprocessor_step_0_unnormalizer_processor.safetensors +3 -0
policy_preprocessor.json +90 -0
policy_preprocessor_step_2_normalizer_processor.safetensors +3 -0
train_config.json +248 -0

README.md ADDED Viewed

	@@ -0,0 +1,149 @@

+---
+license: apache-2.0
+library_name: lerobot
+pipeline_tag: robotics
+tags:
+  - pi05
+  - openpi
+  - lerobot
+  - so101
+  - leisaac
+  - pick-orange
+  - isaac-sim
+  - flow-matching
+  - vla
+  - negative-result
+datasets:
+  - LightwheelAI/leisaac-pick-orange
+language:
+  - en
+---
+# Pi0.5-PickOrange — π0.5 PyTorch expert-only FT (⚠️ negative result)
+**⚠️ 这是一个有据可查的失败实验（已公开作为反面教材 / educational negative result）**：
+20-round strict benchmark = **1/60 oranges (1.7%)**，在 [STRICT_LEADERBOARD](https://github.com/vitorcen/isaaclab-experience/blob/main/scripts/benchmark/STRICT_LEADERBOARD.md) 上末位，**比同任务的 SmolVLA 低 15 倍**。发布的目的是把"为什么 π0.5 在 LeIsaac PickOrange 上学不会"这件事用 ckpt 本身固定下来，供后续研究者复现 / 否证。
+_This is a **deliberately published failure** — a documented negative result. 20-round strict eval = 1/60 oranges (1.7%), last place on the strict leaderboard, **15× worse than SmolVLA** on the same task. Published to anchor the "why π0.5 doesn't learn this task" claim with a real checkpoint, so others can reproduce / refute._
+**🔗 项目仓库 / Project repos**：
+- [vitorcen/isaaclab-experience](https://github.com/vitorcen/isaaclab-experience) — Isaac Lab + LeIsaac 多策略横评（parent project）
+- [vitorcen/LeIsaac-Training](https://github.com/vitorcen/LeIsaac-Training) — LeIsaac fork（训练脚本 + 设计文档 / training scripts + design docs）
+- 完整 negative report HTML: [`pi05_pytorch_expert_ft_negative.html`](https://github.com/vitorcen/LeIsaac-Training/blob/main/docs/training/pi05_pytorch_expert_ft_negative.html)
+## TL;DR
+| Item | Value |
+|------|-------|
+| **任务 / Task** | SO-101 PickOrange — 单臂依次夹起 3 颗橙子放盘子 |
+| **数据集 / Dataset** | [`LightwheelAI/leisaac-pick-orange`](https://huggingface.co/datasets/LightwheelAI/leisaac-pick-orange) (60 demos, 30Hz) |
+| **架构 / Architecture** | π0.5 = PaliGemma-2B VLM (frozen) + Gemma-300M action expert (trainable) + flow-matching |
+| **可训参数 / Trainable params** | 693M (gemma_expert layers 425M + lm_head 263M + norm 3M) |
+| **配方 / Recipe** | `train_expert_only=true`, `freeze_vision_encoder=true`, bf16, lr=2.5e-5, chunk=50, batch=1 + grad_accum=8, 10k steps |
+| **vision input** | **SigLIP @ 224×224**（PaliGemma 硬编码，**主嫌**） |
+| **Strict benchmark** | **1/60 oranges (1.7%)** — 20 rounds × 3 ep × 1 orange/ep, ckpt-2000 |
+| **σ(5-round)** | 0.50 / 15 (3.3%) — worst-case (μ-1σ) = **-0.25 / 15** |
+| **Leaderboard 排名 / Rank** | **6/6（末位）**，低 SmolVLA 15× |
+| **Inference latency** | ~108 ms / chunk (50-step flow matching, RTX 4090) |
+| **GPU hours** | ~3.5 h on RTX Pro 6000 (bf16, ZeRO-2 offload) |
+## 为什么发布失败模型 / Why publish a failed model
+科研里负面结果通常被丢进抽屉，但其实和成功一样有价值：
+1. **锁定假设**：让后续研究者可以 load 这个 ckpt 直接验证"是不是这套配方在这个数据集上真的不行"，避免反复踩同样的坑。
+2. **隔离变量**：训练侧的 dataloader / preprocessor / postprocessor / camera mapping / freeze 配置都已经调通（基础设施 4 个 bug 修完），失败不是 infra 噪声，而是**架构 vs 任务**的真实信号。
+3. **量化"偶尔的 1 只"**：用户最初看到 3-round 跑出 2/9 觉得有希望，但 20-round 1/60 证明那只是 Bernoulli outlier (p≈1.7%)。
+_Negative results matter as much as positive ones. This ckpt lets others verify the failure mode without re-spending the GPU hours._
+## 根因分析（主嫌 80%）/ Root cause (main suspect, 80% confidence)
+**PaliGemma-2B 的 SigLIP vision encoder 硬编码 224×224 输入**，而 LeIsaac 原生 640×480 → 2.86× downscale 后橙子只剩 **10–17 px**，**≤1 个 SigLIP patch (14px)**。
+对比同任务上 work 的模型：
+| Model | Vision encoder | Input res | Orange size after resize | Result |
+|-------|---------------|-----------|--------------------------|--------|
+| GR00T-N1.7 | Eagle-2 ViT | 448 | 22-34 px (1.5–2.4 patch) | 68.3% ✅ |
+| SmolVLA | SigLIP | 512 | 24-40 px (1.7–2.9 patch) | 25.0% ✅ |
+| **π0.5 (this)** | **SigLIP** | **224** | **10-17 px (≤1 patch)** | **1.7% ❌** |
+→ 橙子在 vision token 上几乎不可见，"freeze 整个 PaliGemma + 只训 action expert"再多 token 也无法补救 vision bottleneck。
+_PaliGemma's SigLIP is hardcoded to 224×224 — after downscaling LeIsaac's native 640×480, oranges shrink to ≤1 SigLIP patch. No amount of expert-only training can recover information already lost at the vision encoder._
+## 训练配方 / Training recipe
+```bash
+# 训练入口 / training entry
+bash LeIsaac/scripts/training/pi05_pt/train.sh
+# 关键 flags / key flags
+--policy.train_expert_only=true       # freeze PaliGemma, train only gemma_expert
+--policy.freeze_vision_encoder=true   # explicit redundant lock
+--policy.gradient_checkpointing=true  # 24GB VRAM under bf16
+--policy.dtype=bfloat16
+--policy.chunk_size=50
+--policy.n_action_steps=50
+--policy.max_state_dim=32
+--policy.max_action_dim=32
+--policy.optimizer_lr=2.5e-5
+--steps=10000  --save_freq=1000  --batch_size=1
+```
+Camera rename (LeIsaac 2-cam → π0.5 3-cam, missing `left_wrist` auto-padded inside modeling_pi05.py:1195):
+```python
+rename_map = {
+    "observation.images.front":  "observation.images.base_0_rgb",
+    "observation.images.wrist":  "observation.images.right_wrist_0_rgb",
+}
+```
+## 复现 / Reproduce
+```python
+from lerobot.policies.pi05 import PI05Policy
+policy = PI05Policy.from_pretrained("wsagi/Pi0.5-PickOrange")
+# 然后接 LeIsaac Isaac Sim eval pipeline
+# Then plug into the LeIsaac Isaac Sim eval pipeline:
+#   scripts/benchmark/run_one_strict.sh
+```
+20-round strict benchmark（distribution, 20 rounds × 3 episodes）：
+| P(placed=0) | P(placed=1) | P(placed=2) | P(placed=3) | E(🍊)/ep |
+|-------------|-------------|-------------|-------------|----------|
+| **95% (57/60)** | **5% (3/60)** | 0% | 0% | **0.05** |
+19/20 rounds 全 0/3，1 round 出现 1/3（Episode 8: placed=[F, T, F]）。Bernoulli noise distribution，无 task-completion signal。
+## 已 sweep 过的 ckpt / Checkpoints evaluated
+10k 训练每 1k 存一个，13 个 ckpt（500/1k/1.5k/.../10k）全 3-round 横评 = **1/60 oranges across 13 ckpts**，**全部 0/9 或 1/9**，无单调收敛迹象。ckpt-2000 是 3-round 抓到 2/9 的那个（最高），20-round 跑下来回归到 1/60，证实是 noise outlier 不是 signal。
+## 何时该用 / 不该用 / When (not) to use
+❌ **不要在生产环境使用** — 1.7% success rate 没有 task-completion 价值
+✅ **可以用作**：
+- π0.5 在低分辨率 VLM bottleneck 任务上的 baseline reference
+- "freeze VLM + train expert only" 配方失败案例的复现 ckpt
+- LeIsaac eval pipeline 的 π0.5 wire 协议验证 fixture
+## 替代方案 / Alternatives (better on same task)
+| Model | Strict | Where |
+|-------|--------|-------|
+| 🥇 GR00T-N1.7 (self-trained) | 68.3% | [`wsagi/GR00T-N1.6-PickOrange`](https://huggingface.co/wsagi) |
+| 🥈 SmolVLA (self-trained) | 25.0% | wsagi (待发布 / pending) |
+| 🥉 Diffusion Policy DDIM | 概率性 3/3 | [`wsagi/DiffusionPolicy-PickOrange`](https://huggingface.co/wsagi/DiffusionPolicy-PickOrange) |
+## License & Attribution
+- Apache-2.0
+- Base model: `lerobot/pi05_base` (Physical Intelligence × LeRobot)
+- Dataset: [`LightwheelAI/leisaac-pick-orange`](https://huggingface.co/datasets/LightwheelAI/leisaac-pick-orange)
+- Trained on RTX Pro 6000 96GB
+- Evaluated in Isaac Sim 5.1 + LeIsaac

config.json ADDED Viewed

	@@ -0,0 +1,107 @@

+{
+    "type": "pi05",
+    "n_obs_steps": 1,
+    "input_features": {
+        "observation.images.base_0_rgb": {
+            "type": "VISUAL",
+            "shape": [
+                3,
+                224,
+                224
+            ]
+        },
+        "observation.images.left_wrist_0_rgb": {
+            "type": "VISUAL",
+            "shape": [
+                3,
+                224,
+                224
+            ]
+        },
+        "observation.images.right_wrist_0_rgb": {
+            "type": "VISUAL",
+            "shape": [
+                3,
+                224,
+                224
+            ]
+        },
+        "observation.state": {
+            "type": "STATE",
+            "shape": [
+                32
+            ]
+        }
+    },
+    "output_features": {
+        "action": {
+            "type": "ACTION",
+            "shape": [
+                6
+            ]
+        }
+    },
+    "device": "cuda",
+    "use_amp": false,
+    "use_peft": false,
+    "push_to_hub": false,
+    "repo_id": null,
+    "private": null,
+    "tags": null,
+    "license": null,
+    "pretrained_path": "lerobot/pi05_base",
+    "paligemma_variant": "gemma_2b",
+    "action_expert_variant": "gemma_300m",
+    "dtype": "bfloat16",
+    "chunk_size": 50,
+    "n_action_steps": 50,
+    "max_state_dim": 32,
+    "max_action_dim": 32,
+    "num_inference_steps": 10,
+    "time_sampling_beta_alpha": 1.5,
+    "time_sampling_beta_beta": 1.0,
+    "time_sampling_scale": 0.999,
+    "time_sampling_offset": 0.001,
+    "min_period": 0.004,
+    "max_period": 4.0,
+    "use_relative_actions": false,
+    "relative_exclude_joints": [
+        "gripper"
+    ],
+    "action_feature_names": [
+        "shoulder_pan.pos",
+        "shoulder_lift.pos",
+        "elbow_flex.pos",
+        "wrist_flex.pos",
+        "wrist_roll.pos",
+        "gripper.pos"
+    ],
+    "rtc_config": null,
+    "image_resolution": [
+        224,
+        224
+    ],
+    "empty_cameras": 0,
+    "tokenizer_max_length": 200,
+    "normalization_mapping": {
+        "VISUAL": "IDENTITY",
+        "STATE": "QUANTILES",
+        "ACTION": "QUANTILES"
+    },
+    "gradient_checkpointing": true,
+    "compile_model": false,
+    "compile_mode": "max-autotune",
+    "freeze_vision_encoder": true,
+    "train_expert_only": true,
+    "optimizer_lr": 2.5e-05,
+    "optimizer_betas": [
+        0.9,
+        0.95
+    ],
+    "optimizer_eps": 1e-08,
+    "optimizer_weight_decay": 0.01,
+    "optimizer_grad_clip_norm": 1.0,
+    "scheduler_warmup_steps": 1000,
+    "scheduler_decay_steps": 30000,
+    "scheduler_decay_lr": 2.5e-06
+}

model-00001-of-00011.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:ca835f0e50fcf350dc518ea1dbd04ac6609744c4aaf079abc8e97d6e4a72bc7c
+size 898280512

model-00002-of-00011.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:e662f131ce8545c87b1ed74b0af238725ee50bca4529a7ee73513e79fdeec483
+size 1053294760

model-00003-of-00011.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:8f9ba4311a7118eec21bac703085a38c91f36771758c59d188a320020b2a48ab
+size 1053294784

model-00004-of-00011.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:b702accf389b55f313872ee796d522cf9630086042712102250b63a8bd4b91ff
+size 851767984

model-00005-of-00011.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:6d207be560b6bd5078bf0714a7e71952b31b9e894cf259e4d356b4f7e9a584c7
+size 880875344

model-00006-of-00011.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:6bfc8b874024ff4c38c5862b611cfd1572d809a132f29ef78653e6d1601ecd72
+size 880875360

model-00007-of-00011.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:1439d7dd98da0154639b9fd5245c183784a31281ad76d4961b1181da7d7142e6
+size 880875336

model-00008-of-00011.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:dc3418e4b52fb237dd8b065a8e3f89559a6d969113104aaa941c87e310097921
+size 880875320

model-00009-of-00011.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:385d990ea155f324c1dc98946ec27dfcfc28c386fadc5baa8101601f83d8458b
+size 888081800

model-00010-of-00011.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:011cf1287caf19d5402e7e69c52e18cb603e644d26cd7990dd41ee749fae2ed9
+size 894575464

model-00011-of-00011.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:2f3f34536e9854636f2b100a3dd1cd09cd390ececce0f9d24e5f379f356f19b0
+size 191252832

model.safetensors.index.json ADDED Viewed

The diff for this file is too large to render. See raw diff

policy_postprocessor.json ADDED Viewed

	@@ -0,0 +1,32 @@

+{
+  "name": "policy_postprocessor",
+  "steps": [
+    {
+      "registry_name": "unnormalizer_processor",
+      "config": {
+        "eps": 1e-08,
+        "features": {
+          "action": {
+            "type": "ACTION",
+            "shape": [
+              6
+            ]
+          }
+        },
+        "norm_map": {
+          "VISUAL": "IDENTITY",
+          "STATE": "QUANTILES",
+          "ACTION": "QUANTILES"
+        }
+      },
+      "state_file": "policy_postprocessor_step_0_unnormalizer_processor.safetensors"
+    },
+    {
+      "registry_name": "device_processor",
+      "config": {
+        "device": "cpu",
+        "float_dtype": null
+      }
+    }
+  ]
+}

policy_postprocessor_step_0_unnormalizer_processor.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:5ac4af145fa293fb9282322bee7c87eb369ba8aca3e09dbf1db7600f46142fd5
+size 7552

policy_preprocessor.json ADDED Viewed

	@@ -0,0 +1,90 @@

+{
+  "name": "policy_preprocessor",
+  "steps": [
+    {
+      "registry_name": "rename_observations_processor",
+      "config": {
+        "rename_map": {
+          "observation.images.front": "observation.images.base_0_rgb",
+          "observation.images.wrist": "observation.images.right_wrist_0_rgb"
+        }
+      }
+    },
+    {
+      "registry_name": "to_batch_processor",
+      "config": {}
+    },
+    {
+      "registry_name": "normalizer_processor",
+      "config": {
+        "eps": 1e-08,
+        "features": {
+          "observation.images.base_0_rgb": {
+            "type": "VISUAL",
+            "shape": [
+              3,
+              224,
+              224
+            ]
+          },
+          "observation.images.left_wrist_0_rgb": {
+            "type": "VISUAL",
+            "shape": [
+              3,
+              224,
+              224
+            ]
+          },
+          "observation.images.right_wrist_0_rgb": {
+            "type": "VISUAL",
+            "shape": [
+              3,
+              224,
+              224
+            ]
+          },
+          "observation.state": {
+            "type": "STATE",
+            "shape": [
+              32
+            ]
+          },
+          "action": {
+            "type": "ACTION",
+            "shape": [
+              6
+            ]
+          }
+        },
+        "norm_map": {
+          "VISUAL": "IDENTITY",
+          "STATE": "QUANTILES",
+          "ACTION": "QUANTILES"
+        }
+      },
+      "state_file": "policy_preprocessor_step_2_normalizer_processor.safetensors"
+    },
+    {
+      "registry_name": "pi05_prepare_state_tokenizer_processor_step",
+      "config": {}
+    },
+    {
+      "registry_name": "tokenizer_processor",
+      "config": {
+        "max_length": 200,
+        "task_key": "task",
+        "padding_side": "right",
+        "padding": "max_length",
+        "truncation": true,
+        "tokenizer_name": "google/paligemma-3b-pt-224"
+      }
+    },
+    {
+      "registry_name": "device_processor",
+      "config": {
+        "device": "cuda",
+        "float_dtype": null
+      }
+    }
+  ]
+}

policy_preprocessor_step_2_normalizer_processor.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:5ac4af145fa293fb9282322bee7c87eb369ba8aca3e09dbf1db7600f46142fd5
+size 7552

train_config.json ADDED Viewed

	@@ -0,0 +1,248 @@

+{
+    "dataset": {
+        "repo_id": "LightwheelAI/leisaac-pick-orange",
+        "root": "/home/david/work/isaaclab-experience/LeIsaac/datasets/raw/leisaac-pick-orange",
+        "episodes": null,
+        "image_transforms": {
+            "enable": false,
+            "max_num_transforms": 3,
+            "random_order": false,
+            "tfs": {
+                "brightness": {
+                    "weight": 1.0,
+                    "type": "ColorJitter",
+                    "kwargs": {
+                        "brightness": [
+                            0.8,
+                            1.2
+                        ]
+                    }
+                },
+                "contrast": {
+                    "weight": 1.0,
+                    "type": "ColorJitter",
+                    "kwargs": {
+                        "contrast": [
+                            0.8,
+                            1.2
+                        ]
+                    }
+                },
+                "saturation": {
+                    "weight": 1.0,
+                    "type": "ColorJitter",
+                    "kwargs": {
+                        "saturation": [
+                            0.5,
+                            1.5
+                        ]
+                    }
+                },
+                "hue": {
+                    "weight": 1.0,
+                    "type": "ColorJitter",
+                    "kwargs": {
+                        "hue": [
+                            -0.05,
+                            0.05
+                        ]
+                    }
+                },
+                "sharpness": {
+                    "weight": 1.0,
+                    "type": "SharpnessJitter",
+                    "kwargs": {
+                        "sharpness": [
+                            0.5,
+                            1.5
+                        ]
+                    }
+                },
+                "affine": {
+                    "weight": 1.0,
+                    "type": "RandomAffine",
+                    "kwargs": {
+                        "degrees": [
+                            -5.0,
+                            5.0
+                        ],
+                        "translate": [
+                            0.05,
+                            0.05
+                        ]
+                    }
+                }
+            }
+        },
+        "revision": null,
+        "use_imagenet_stats": true,
+        "video_backend": "torchcodec",
+        "return_uint8": false,
+        "streaming": false
+    },
+    "env": null,
+    "policy": {
+        "type": "pi05",
+        "n_obs_steps": 1,
+        "input_features": {
+            "observation.images.base_0_rgb": {
+                "type": "VISUAL",
+                "shape": [
+                    3,
+                    224,
+                    224
+                ]
+            },
+            "observation.images.left_wrist_0_rgb": {
+                "type": "VISUAL",
+                "shape": [
+                    3,
+                    224,
+                    224
+                ]
+            },
+            "observation.images.right_wrist_0_rgb": {
+                "type": "VISUAL",
+                "shape": [
+                    3,
+                    224,
+                    224
+                ]
+            },
+            "observation.state": {
+                "type": "STATE",
+                "shape": [
+                    32
+                ]
+            }
+        },
+        "output_features": {
+            "action": {
+                "type": "ACTION",
+                "shape": [
+                    6
+                ]
+            }
+        },
+        "device": "cuda",
+        "use_amp": false,
+        "use_peft": false,
+        "push_to_hub": false,
+        "repo_id": null,
+        "private": null,
+        "tags": null,
+        "license": null,
+        "pretrained_path": "lerobot/pi05_base",
+        "paligemma_variant": "gemma_2b",
+        "action_expert_variant": "gemma_300m",
+        "dtype": "bfloat16",
+        "chunk_size": 50,
+        "n_action_steps": 50,
+        "max_state_dim": 32,
+        "max_action_dim": 32,
+        "num_inference_steps": 10,
+        "time_sampling_beta_alpha": 1.5,
+        "time_sampling_beta_beta": 1.0,
+        "time_sampling_scale": 0.999,
+        "time_sampling_offset": 0.001,
+        "min_period": 0.004,
+        "max_period": 4.0,
+        "use_relative_actions": false,
+        "relative_exclude_joints": [
+            "gripper"
+        ],
+        "action_feature_names": [
+            "shoulder_pan.pos",
+            "shoulder_lift.pos",
+            "elbow_flex.pos",
+            "wrist_flex.pos",
+            "wrist_roll.pos",
+            "gripper.pos"
+        ],
+        "rtc_config": null,
+        "image_resolution": [
+            224,
+            224
+        ],
+        "empty_cameras": 0,
+        "tokenizer_max_length": 200,
+        "normalization_mapping": {
+            "VISUAL": "IDENTITY",
+            "STATE": "QUANTILES",
+            "ACTION": "QUANTILES"
+        },
+        "gradient_checkpointing": true,
+        "compile_model": false,
+        "compile_mode": "max-autotune",
+        "freeze_vision_encoder": true,
+        "train_expert_only": true,
+        "optimizer_lr": 2.5e-05,
+        "optimizer_betas": [
+            0.9,
+            0.95
+        ],
+        "optimizer_eps": 1e-08,
+        "optimizer_weight_decay": 0.01,
+        "optimizer_grad_clip_norm": 1.0,
+        "scheduler_warmup_steps": 1000,
+        "scheduler_decay_steps": 30000,
+        "scheduler_decay_lr": 2.5e-06
+    },
+    "reward_model": null,
+    "output_dir": "/home/david/work/isaaclab-experience/LeIsaac/outputs/pi05-expert-leisaac-pick-orange",
+    "job_name": "pi05",
+    "resume": false,
+    "seed": 1000,
+    "cudnn_deterministic": false,
+    "num_workers": 4,
+    "batch_size": 1,
+    "prefetch_factor": 4,
+    "persistent_workers": true,
+    "steps": 2500,
+    "eval_freq": 20000,
+    "log_freq": 200,
+    "tolerance_s": 0.0001,
+    "save_checkpoint": true,
+    "save_freq": 500,
+    "use_policy_training_preset": true,
+    "optimizer": {
+        "type": "adamw",
+        "lr": 2.5e-05,
+        "weight_decay": 0.01,
+        "grad_clip_norm": 1.0,
+        "betas": [
+            0.9,
+            0.95
+        ],
+        "eps": 1e-08
+    },
+    "scheduler": {
+        "type": "cosine_decay_with_warmup",
+        "num_warmup_steps": 1000,
+        "num_decay_steps": 30000,
+        "peak_lr": 2.5e-05,
+        "decay_lr": 2.5e-06
+    },
+    "eval": {
+        "n_episodes": 50,
+        "batch_size": 22,
+        "use_async_envs": true
+    },
+    "wandb": {
+        "enable": false,
+        "disable_artifact": false,
+        "project": "lerobot",
+        "entity": null,
+        "notes": null,
+        "run_id": null,
+        "mode": null,
+        "add_tags": true
+    },
+    "peft": null,
+    "sample_weighting": null,
+    "rename_map": {
+        "observation.images.front": "observation.images.base_0_rgb",
+        "observation.images.wrist": "observation.images.right_wrist_0_rgb"
+    },
+    "checkpoint_path": null
+}