lerobot
/

diffusion_pusht

diffusion-policy

model_hub_mixin

pytorch_model_hub_mixin

Model card Files Files and versions

alexandersoare commited on May 6, 2024

Commit

49dda6b

·

verified ·

1 Parent(s): 37628ec

Upload folder using huggingface_hub

Files changed (1) hide show

README.md +7 -4

README.md CHANGED Viewed

@@ -1,3 +1,8 @@
 # Model Card for Diffusion Policy / PushT
 Diffusion Policy (as per [Diffusion Policy: Visuomotor Policy
@@ -11,10 +16,6 @@ See the [LeRobot library](https://github.com/huggingface/lerobot) (particularly
 ## Training Details
-TODO commit hash.
-Trained with [LeRobot@d747195](https://github.com/huggingface/lerobot/tree/d747195c5733c4f68d4bfbe62632d6fc1b605712).
 The model was trained using [LeRobot's training script](https://github.com/huggingface/lerobot/blob/d747195c5733c4f68d4bfbe62632d6fc1b605712/lerobot/scripts/train.py) and with the [pusht](https://huggingface.co/datasets/lerobot/pusht/tree/v1.3) dataset.
 Here are the [loss](./train_loss.csv), [evaluation score](./eval_avg_max_reward.csv), [evaluation success rate](./eval_pc_success.csv) (with 50 rollouts) during training.
@@ -23,6 +24,8 @@ Here are the [loss](./train_loss.csv), [evaluation score](./eval_avg_max_reward.
 This took about 7 hours to train on an Nvida RTX 3090.
 ## Evaluation
 The model was evaluated on the `PushT` environment from [gym-pusht](https://github.com/huggingface/gym-pusht) and compared to a similar model trained with the original [Diffusion Policy code](https://github.com/real-stanford/diffusion_policy). There are two evaluation metrics on a per-episode basis:

+---
+license: apache-2.0
+datasets:
+- lerobot/pusht
+---
 # Model Card for Diffusion Policy / PushT
 Diffusion Policy (as per [Diffusion Policy: Visuomotor Policy
 ## Training Details
 The model was trained using [LeRobot's training script](https://github.com/huggingface/lerobot/blob/d747195c5733c4f68d4bfbe62632d6fc1b605712/lerobot/scripts/train.py) and with the [pusht](https://huggingface.co/datasets/lerobot/pusht/tree/v1.3) dataset.
 Here are the [loss](./train_loss.csv), [evaluation score](./eval_avg_max_reward.csv), [evaluation success rate](./eval_pc_success.csv) (with 50 rollouts) during training.
 This took about 7 hours to train on an Nvida RTX 3090.
+_Note: At the time of training, [this PR](https://github.com/huggingface/lerobot/pull/129) was also incorporated._
 ## Evaluation
 The model was evaluated on the `PushT` environment from [gym-pusht](https://github.com/huggingface/gym-pusht) and compared to a similar model trained with the original [Diffusion Policy code](https://github.com/real-stanford/diffusion_policy). There are two evaluation metrics on a per-episode basis: