bryandong
/

pushf-diffusion-policy

diffusion-policy

imitation-learning

behavioral-cloning

Model card Files Files and versions

pushf-diffusion-policy / README.md

bryandong's picture

Upload README.md with huggingface_hub

9f99dae verified 14 days ago

|

history blame contribute delete

2.47 kB

	---
	license: mit
	tags:
	- diffusion-policy
	- imitation-learning
	- behavioral-cloning
	- robotics
	- push-task
	library_name: pytorch
	pipeline_tag: robotics
	---

	# Push-F Diffusion Policy

	A visuomotor diffusion policy trained to push an F-shaped block into a target orientation, adapted from the [Diffusion Policy](https://diffusion-policy.cs.columbia.edu/) codebase (Chi et al., 2023).

	## Model Description

	- Architecture: Diffusion UNet with ResNet18 image encoder
	- Parameters: 278M
	- Observations: 96x96 RGB image + 2D agent position
	- Actions: 2D target position for the agent
	- Training data: 101 human demonstrations (~29,800 timesteps)
	- Training: 250 epochs on NVIDIA H100, ~3.5 hours
	- Framework: PyTorch 2.0.1

	## Performance

	Evaluated on 50 held-out environment seeds:

	\| Time Limit \| Mean Score \| Perfect Seeds (1.0) \|
	\|:----------:\|:----------:\|:-------------------:\|
	\| 30s \| 0.837 \| 19/50 \|
	\| 45s \| 0.945 \| 38/50 \|
	\| 60s \| 0.961 \| 45/50 \|
	\| 90s \| 1.000 \| 50/50 \|

	## Usage

	```bash
	git clone https://github.com/bryandong24/reu_adaptation.git
	cd reu_adaptation

	# Set up environment
	mamba env create -f conda_environment.yaml
	conda activate robodiff
	pip install torch==2.0.1+cu118 torchvision==0.15.2+cu118 --extra-index-url https://download.pytorch.org/whl/cu118
	pip install -e .

	# Download checkpoint and evaluate
	python eval.py --checkpoint epoch=0250-test_mean_score=0.880.ckpt -o eval_output
	```

	## Training Details

	- Loss: MSE denoising loss (DDPM)
	- Optimizer: AdamW (lr=1e-4, weight_decay=1e-6)
	- LR Schedule: Cosine with 500-step warmup
	- Batch size: 64
	- Horizon: 16 steps (n_obs=2, n_action=8)
	- Diffusion steps: 100 (training), 100 (inference)
	- EMA: Enabled

	## Citation

	Based on:
	```bibtex
	@inproceedings{chi2023diffusionpolicy,
	title={Diffusion Policy: Visuomotor Policy Learning via Action Diffusion},
	author={Chi, Cheng and Feng, Siyuan and Du, Yilun and Xu, Zhenjia and Cousineau, Eric and Burchfiel, Benjamin and Song, Shuran},
	booktitle={Proceedings of Robotics: Science and Systems (RSS)},
	year={2023}
	}
	```

	## Links

	- [Full Report](https://github.com/bryandong24/reu_adaptation/blob/main/REPORT.md)
	- [Source Code](https://github.com/bryandong24/reu_adaptation)
	- [W&B Training Logs](https://wandb.ai/bryandong24-stanford-university/diffusion_policy_pushf)