bryandong
/

pushf-diffusion-policy

+---
+license: mit
+tags:
+- diffusion-policy
+- imitation-learning
+- behavioral-cloning
+- robotics
+- push-task
+library_name: pytorch
+pipeline_tag: robotics
+---
+# Push-F Diffusion Policy
+A visuomotor diffusion policy trained to push an F-shaped block into a target orientation, adapted from the [Diffusion Policy](https://diffusion-policy.cs.columbia.edu/) codebase (Chi et al., 2023).
+## Model Description
+- **Architecture:** Diffusion UNet with ResNet18 image encoder
+- **Parameters:** 278M
+- **Observations:** 96x96 RGB image + 2D agent position
+- **Actions:** 2D target position for the agent
+- **Training data:** 101 human demonstrations (~29,800 timesteps)
+- **Training:** 250 epochs on NVIDIA H100, ~3.5 hours
+- **Framework:** PyTorch 2.0.1
+## Performance
+Evaluated on 50 held-out environment seeds:
+| Time Limit | Mean Score | Perfect Seeds (1.0) |
+|:----------:|:----------:|:-------------------:|
+| 30s        | 0.837      | 19/50               |
+| 45s        | 0.945      | 38/50               |
+| 60s        | 0.961      | 45/50               |
+| 90s        | 1.000      | 50/50               |
+## Usage
+```bash
+git clone https://github.com/bryandong24/reu_adaptation.git
+cd reu_adaptation
+# Set up environment
+mamba env create -f conda_environment.yaml
+conda activate robodiff
+pip install torch==2.0.1+cu118 torchvision==0.15.2+cu118 --extra-index-url https://download.pytorch.org/whl/cu118
+pip install -e .
+# Download checkpoint and evaluate
+python eval.py --checkpoint epoch=0250-test_mean_score=0.880.ckpt -o eval_output
+```
+## Training Details
+- **Loss:** MSE denoising loss (DDPM)
+- **Optimizer:** AdamW (lr=1e-4, weight_decay=1e-6)
+- **LR Schedule:** Cosine with 500-step warmup
+- **Batch size:** 64
+- **Horizon:** 16 steps (n_obs=2, n_action=8)
+- **Diffusion steps:** 100 (training), 100 (inference)
+- **EMA:** Enabled
+## Citation
+Based on:
+```bibtex
+@inproceedings{chi2023diffusionpolicy,
+    title={Diffusion Policy: Visuomotor Policy Learning via Action Diffusion},
+    author={Chi, Cheng and Feng, Siyuan and Du, Yilun and Xu, Zhenjia and Cousineau, Eric and Burchfiel, Benjamin and Song, Shuran},
+    booktitle={Proceedings of Robotics: Science and Systems (RSS)},
+    year={2023}
+}
+```
+## Links
+- [Full Report](https://github.com/bryandong24/reu_adaptation/blob/main/REPORT.md)
+- [Source Code](https://github.com/bryandong24/reu_adaptation)
+- [W&B Training Logs](https://wandb.ai/bryandong24-stanford-university/diffusion_policy_pushf)