ReGuide β Checkpoints
Pretrained diffusion policy and visual dynamics model checkpoints for ReGuide: From Test-Time Guidance to Self-Improving Diffusion Policies (Lin, Shakkottai, Kalathil, Kumar β Texas A&M University).
ReGuide is a self-improving framework that recycles guided rollouts as on-policy recovery data to fix covariate shift in behavior-cloned diffusion policies, improving base-policy success by 1.3β7.7Γ on Robomimic. See the project page and paper for method details.
Checkpoints are organized by benchmark:
robomimic/β Can, Square, Transport, Tool Hang (available now)libero/β coming soon
Robomimic tasks
| Task | Demos | Notes |
|---|---|---|
can |
15 | policy + dynamics model |
square |
30 | policy + dynamics model |
transport |
10 | policy + dynamics model |
tool_hang |
80 | policy + dynamics model |
Repository layout
robomimic/diffusion_policy/<task>/<variant>/
βββ checkpoints/<epoch>.ckpt # policy weights (~4.3β5.3 GB)
βββ normalizer.pth # observation/action normalizer
βββ .hydra/ # full training config
robomimic/dyn_model/<task>/
βββ checkpoints/model_<epoch>.pth # visual dynamics model (~0.5β1.5 GB)
βββ normalizer.pth
βββ hydra.yaml # training config
The number in each checkpoint filename (e.g. 880.ckpt) is the training epoch.
Diffusion policy variants
| Variant | What it is |
|---|---|
base_policy |
Diffusion policy trained on the original demonstrations only. |
ReGuide-FS |
Retrained from scratch on demos + guided rollouts. |
ReGuide-FT-iteration1 |
Base checkpoint fine-tuned on demos + guided rollouts (first iteration). |
ReGuide-FT-iteration2 |
Second fine-tuning iteration on freshly collected guided rollouts. |
ReGuide-FS-FT |
ReGuide-FT applied on top of a ReGuide-FS policy (best on Can/Square/Transport). |
Each checkpoint ships with its full Hydra training config (.hydra/ for policies,
hydra.yaml for dynamics models), which records the exact hyperparameters used. Paths
inside the configs are relative.
Code
Training and evaluation code: https://github.com/tzuhsiangl/reguide
Citation
@article{lin2026reguide,
title = {ReGuide: From Test-Time Guidance to Self-Improving Diffusion Policies},
author = {Lin, Tzu-Hsiang and Shakkottai, Srinivas and Kalathil, Dileep and Kumar, P. R.},
year = {2026},
url = {https://reguide-project.github.io}
}
License
Released under the MIT License.