iFaz commited on
Commit
7b90c61
·
verified ·
1 Parent(s): 82c81d2

Upload policy weights, train config and readme

Browse files
Files changed (3) hide show
  1. README.md +36 -64
  2. model.safetensors +1 -1
  3. train_config.json +2 -2
README.md CHANGED
@@ -1,90 +1,62 @@
1
  ---
2
- datasets:
3
- - lerobot/pusht
4
- language: en
5
  library_name: lerobot
6
  license: apache-2.0
 
 
7
  tags:
8
- - robotics
9
- - imitation-learning
10
  - diffusion
11
- - mujoco
12
- - pytorch_model_hub_mixin
13
  ---
14
 
15
- # DIFFUSION Policy diffusion_pusht_seed3
16
-
17
- Trained with [LeRobot](https://github.com/huggingface/lerobot).
18
- Date: `2026-05-28 17:14`
19
- Policy type: `diffusion` | Device: `cuda`
20
-
21
- ---
22
 
23
- ## 📦 Dataset
24
 
25
- | Parameter | Value |
26
- |---|---|
27
- | `dataset.repo_id` | `lerobot/pusht` |
28
 
29
- ---
30
 
31
- ## 🏋️ Training Config
32
 
33
- | Parameter | Value |
34
- |---|---|
35
- | `steps` | `7000` |
36
- | `batch_size` | `8` |
37
- | `eval_freq` | `0` |
38
- | `save_freq` | `2000` |
39
- | `num_workers` | `4` |
40
- | `seed` | `3` |
41
- | `eval.n_episodes` | `1` |
42
- | `eval.batch_size` | `1` |
43
- | `eval.use_async_envs` | `True` |
44
 
45
  ---
46
 
47
- ## 📐 Policy Architecture
48
 
49
- | Parameter | Value |
50
- |---|---|
51
- | `noise_scheduler_type` | `DDIM` |
52
- | `num_inference_steps` | `15` |
53
 
54
- ---
55
 
56
- ## 🎯 Eval Config
 
 
 
 
 
 
 
 
 
57
 
58
- | Parameter | Value |
59
- |---|---|
60
- | `env.type` | `pusht` |
61
- | `env.task` | `PushT-v0` |
62
- | `eval.n_episodes` | `8` |
63
- | `eval.batch_size` | `4` |
64
- | `eval.use_async_envs` | `False` |
65
- | `policy.path` | `/kaggle/working/outputs/train/pusht_seed3/checkpoints/last/pretrained_model` |
66
 
67
- ---
68
 
69
- ## 📊 Eval Results
 
 
 
 
 
 
70
 
71
- | Metric | Value |
72
- |---|---|
73
- | Episodes | `8` |
74
- | Success rate | `0.0%` |
75
- | Avg sum reward | `18.81` |
76
- | Avg max reward | `0.35` |
77
- | Eval time (s) | `52.1` |
78
 
79
  ---
80
 
81
- ## Citation
82
 
83
- ```bibtex
84
- @misc{cadene2024lerobot,
85
- author = {Cadene, Remi and Alibert, Simon and others},
86
- title = {LeRobot},
87
- year = {2024},
88
- url = {https://github.com/huggingface/lerobot}
89
- }
90
- ```
 
1
  ---
2
+ datasets: lerobot/pusht
 
 
3
  library_name: lerobot
4
  license: apache-2.0
5
+ model_name: diffusion
6
+ pipeline_tag: robotics
7
  tags:
 
 
8
  - diffusion
9
+ - lerobot
10
+ - robotics
11
  ---
12
 
13
+ # Model Card for diffusion
 
 
 
 
 
 
14
 
15
+ <!-- Provide a quick summary of what the model is/does. -->
16
 
 
 
 
17
 
18
+ [Diffusion Policy](https://huggingface.co/papers/2303.04137) treats visuomotor control as a generative diffusion process, producing smooth, multi-step action trajectories that excel at contact-rich manipulation.
19
 
 
20
 
21
+ This policy has been trained and pushed to the Hub using [LeRobot](https://github.com/huggingface/lerobot).
22
+ See the full documentation at [LeRobot Docs](https://huggingface.co/docs/lerobot/index).
 
 
 
 
 
 
 
 
 
23
 
24
  ---
25
 
26
+ ## How to Get Started with the Model
27
 
28
+ For a complete walkthrough, see the [training guide](https://huggingface.co/docs/lerobot/il_robots#train-a-policy).
29
+ Below is the short version on how to train and run inference/eval:
 
 
30
 
31
+ ### Train from scratch
32
 
33
+ ```bash
34
+ lerobot-train \
35
+ --dataset.repo_id=${HF_USER}/<dataset> \
36
+ --policy.type=act \
37
+ --output_dir=outputs/train/<desired_policy_repo_id> \
38
+ --job_name=lerobot_training \
39
+ --policy.device=cuda \
40
+ --policy.repo_id=${HF_USER}/<desired_policy_repo_id>
41
+ --wandb.enable=true
42
+ ```
43
 
44
+ _Writes checkpoints to `outputs/train/<desired_policy_repo_id>/checkpoints/`._
 
 
 
 
 
 
 
45
 
46
+ ### Evaluate the policy/run inference
47
 
48
+ ```bash
49
+ lerobot-record \
50
+ --robot.type=so100_follower \
51
+ --dataset.repo_id=<hf_user>/eval_<dataset> \
52
+ --policy.path=<hf_user>/<desired_policy_repo_id> \
53
+ --episodes=10
54
+ ```
55
 
56
+ Prefix the dataset repo with **eval\_** and supply `--policy.path` pointing to a local or hub checkpoint.
 
 
 
 
 
 
57
 
58
  ---
59
 
60
+ ## Model Details
61
 
62
+ - **License:** apache-2.0
 
 
 
 
 
 
 
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:7cc4844b37b6729b16ad50530ccc1f2ed634231b5ba5151488604f1a7f624a01
3
  size 1050861448
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:af966f256674081411fdb62b1c626c8ad1de6225f26caebd3b7faf80a5bf0044
3
  size 1050861448
train_config.json CHANGED
@@ -215,12 +215,12 @@
215
  "cudnn_deterministic": false,
216
  "num_workers": 4,
217
  "batch_size": 8,
218
- "steps": 7000,
219
  "eval_freq": 0,
220
  "log_freq": 200,
221
  "tolerance_s": 0.0001,
222
  "save_checkpoint": true,
223
- "save_freq": 2000,
224
  "use_policy_training_preset": true,
225
  "optimizer": {
226
  "type": "adam",
 
215
  "cudnn_deterministic": false,
216
  "num_workers": 4,
217
  "batch_size": 8,
218
+ "steps": 70000,
219
  "eval_freq": 0,
220
  "log_freq": 200,
221
  "tolerance_s": 0.0001,
222
  "save_checkpoint": true,
223
+ "save_freq": 20000,
224
  "use_policy_training_preset": true,
225
  "optimizer": {
226
  "type": "adam",