wintermelontree
/

robomimic-pretrain-checkpoints

Model card Files Files and versions

Add model card for DICE-RL

#1

by nielsr HF Staff - opened 28 days ago

base: refs/heads/main

←

from: refs/pr/1

Discussion Files changed

Files changed (1) hide show

README.md +33 -0

README.md ADDED Viewed

	@@ -0,0 +1,33 @@

+---
+pipeline_tag: robotics
+---
+# From Prior to Pro: Efficient Skill Mastery via Distribution Contractive RL Finetuning (DICE-RL)
+This repository contains the checkpoints for **DICE-RL**, a framework that uses reinforcement learning (RL) as a "distribution contraction" operator to refine pretrained generative robot policies.
+[**Project Website**](https://zhanyisun.github.io/dice.rl.2026/) | [**Paper**](https://huggingface.co/papers/2603.10263) | [**GitHub**](https://github.com/zhanyisun/dice-rl)
+## Introduction
+Distribution Contractive Reinforcement Learning (DICE-RL) turns a pretrained behavior prior into a high-performing "pro" policy by amplifying high-success behaviors from online feedback. The framework pretrains a diffusion- or flow-based policy for broad behavioral coverage, then finetunes it with a stable, sample-efficient residual off-policy RL framework that combines selective behavior regularization with value-guided action selection. It enables mastery of complex long-horizon manipulation skills directly from high-dimensional pixel inputs.
+## Evaluation
+To evaluate the finetuned RL checkpoints and pretrained BC checkpoints and to get success rates for both, use the following command from the [official repository](https://github.com/zhanyisun/dice-rl):
+```bash
+python script/eval_rl_checkpoint.py --ckpt_path path_to_finetuned_checkpoint --num_eval_episodes 10 --eval_n_envs 10
+```
+The output will include the success rates for both the finetuned RL checkpoint and the pretrained BC checkpoint, as well as the gain of the finetuned RL checkpoint over the pretrained BC checkpoint.
+## Citation
+If you find this work or the checkpoints useful, please consider citing:
+```bibtex
+@article{sun2026prior,
+  title={From Prior to Pro: Efficient Skill Mastery via Distribution Contractive RL Finetuning},
+  author={Sun, Zhanyi and Song, Shuran},
+  journal={arXiv preprint arXiv:2603.10263},
+  year={2026}
+}
+```