Create README.md
Browse files
README.md
ADDED
|
@@ -0,0 +1,32 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
license: mit
|
| 3 |
+
datasets:
|
| 4 |
+
- lerobot/pusht_image
|
| 5 |
+
---
|
| 6 |
+
|
| 7 |
+
# Model Card for Mini Diffusion Policy / PushT
|
| 8 |
+
|
| 9 |
+
We add few lines to add an extra level-2 minibatch for Diffusion Policy (as per [Mini Diffuser (ICRA 2026)](https://arxiv.org/abs/2505.09430)) trained for the `PushT` environment from [gym-pusht](https://github.com/huggingface/gym-pusht). This enables tens of equivalent batch size per gradient step, and end up saving at least 60% of the training time to obtain similar training results.
|
| 10 |
+
|
| 11 |
+
## How to Get Started with the Model
|
| 12 |
+
|
| 13 |
+
See the [LeRobot library](https://github.com/huggingface/lerobot) for instructions on how to load and evaluate this model.
|
| 14 |
+
|
| 15 |
+
## Training Details
|
| 16 |
+
|
| 17 |
+
Trained with a forked [LeRobot@
|
| 18 |
+
7bd533a](https://github.com/utomm/lerobot/tree/minidp-0.4.2).
|
| 19 |
+
|
| 20 |
+
The model was trained using [LeRobot's training script](https://github.com/huggingface/lerobot/blob/main/lerobot/scripts/train.py) and with the [pusht](https://huggingface.co/datasets/lerobot/pusht) dataset, using this command:
|
| 21 |
+
|
| 22 |
+
```bash
|
| 23 |
+
lerobot-train --policy.type=minidiffusion --dataset.repo_id=lerobot/pusht_image\
|
| 24 |
+
--env.type=pusht --seed=100000 --batch_size=32 --log_freq=200 --wandb.disable_artifact=true\
|
| 25 |
+
--steps=100000 --eval_freq=10000 --save_freq=10000 --wandb.enable=true --policy.repo_id=id\
|
| 26 |
+
--wandb.project=minidp --policy.push_to_hub=false --policy.level2_batch_size=8 --job_name=minidp-32-8
|
| 27 |
+
```
|
| 28 |
+
|
| 29 |
+
|
| 30 |
+
The training curves, and the comparasions with original DP may be found at https://api.wandb.ai/links/hu2240877635/defcr4wu
|
| 31 |
+
<iframe src="https://wandb.ai/hu2240877635/minidp/reports/Accelerating-Diffusion-Policy-Training-with-MiniDP--VmlldzoxNjI1NzQ0OA" style="border:none;height:1024px;width:100%">
|
| 32 |
+
The current model corresponds to the checkpoint at 90k steps.
|