iFaz commited on
Commit
547640d
·
verified ·
1 Parent(s): faa7f18

Upload policy weights, train config and readme

Browse files
Files changed (3) hide show
  1. README.md +36 -61
  2. model.safetensors +1 -1
  3. train_config.json +3 -3
README.md CHANGED
@@ -1,87 +1,62 @@
1
  ---
2
- datasets:
3
- - lerobot/pusht
4
- language: en
5
  library_name: lerobot
6
  license: apache-2.0
 
 
7
  tags:
8
- - robotics
9
- - imitation-learning
10
  - diffusion
11
- - mujoco
12
- - pytorch_model_hub_mixin
13
  ---
14
 
15
- # DIFFUSION Policy diffusion_pusht_seed42
16
-
17
- Trained with [LeRobot](https://github.com/huggingface/lerobot).
18
- Date: `2026-05-27 18:49`
19
- Policy type: `diffusion` | Device: `cuda`
20
-
21
- ---
22
 
23
- ## 📦 Dataset
24
 
25
- | Parameter | Value |
26
- |---|---|
27
- | `dataset.repo_id` | `lerobot/pusht` |
28
 
29
- ---
30
 
31
- ## 🏋️ Training Config
32
 
33
- | Parameter | Value |
34
- |---|---|
35
- | `steps` | `70` |
36
- | `batch_size` | `8` |
37
- | `eval_freq` | `0` |
38
- | `save_freq` | `20` |
39
- | `num_workers` | `4` |
40
- | `seed` | `42` |
41
- | `eval.n_episodes` | `1` |
42
- | `eval.batch_size` | `1` |
43
- | `eval.use_async_envs` | `True` |
44
 
45
  ---
46
 
47
- ## 📐 Policy Architecture
48
 
49
- _All defaults no overrides applied._
 
50
 
51
- ---
52
 
53
- ## 🎯 Eval Config
 
 
 
 
 
 
 
 
 
54
 
55
- | Parameter | Value |
56
- |---|---|
57
- | `env.type` | `pusht` |
58
- | `env.task` | `PushT-v0` |
59
- | `eval.n_episodes` | `8` |
60
- | `eval.batch_size` | `4` |
61
- | `eval.use_async_envs` | `False` |
62
- | `policy.path` | `/kaggle/working/outputs/train/pusht_seed42/checkpoints/last/pretrained_model` |
63
 
64
- ---
65
 
66
- ## 📊 Eval Results
 
 
 
 
 
 
67
 
68
- | Metric | Value |
69
- |---|---|
70
- | Episodes | `8` |
71
- | Success rate | `0.0%` |
72
- | Avg sum reward | `31.51` |
73
- | Avg max reward | `0.19` |
74
- | Eval time (s) | `173.4` |
75
 
76
  ---
77
 
78
- ## Citation
79
 
80
- ```bibtex
81
- @misc{cadene2024lerobot,
82
- author = {Cadene, Remi and Alibert, Simon and others},
83
- title = {LeRobot},
84
- year = {2024},
85
- url = {https://github.com/huggingface/lerobot}
86
- }
87
- ```
 
1
  ---
2
+ datasets: lerobot/pusht
 
 
3
  library_name: lerobot
4
  license: apache-2.0
5
+ model_name: diffusion
6
+ pipeline_tag: robotics
7
  tags:
 
 
8
  - diffusion
9
+ - robotics
10
+ - lerobot
11
  ---
12
 
13
+ # Model Card for diffusion
 
 
 
 
 
 
14
 
15
+ <!-- Provide a quick summary of what the model is/does. -->
16
 
 
 
 
17
 
18
+ [Diffusion Policy](https://huggingface.co/papers/2303.04137) treats visuomotor control as a generative diffusion process, producing smooth, multi-step action trajectories that excel at contact-rich manipulation.
19
 
 
20
 
21
+ This policy has been trained and pushed to the Hub using [LeRobot](https://github.com/huggingface/lerobot).
22
+ See the full documentation at [LeRobot Docs](https://huggingface.co/docs/lerobot/index).
 
 
 
 
 
 
 
 
 
23
 
24
  ---
25
 
26
+ ## How to Get Started with the Model
27
 
28
+ For a complete walkthrough, see the [training guide](https://huggingface.co/docs/lerobot/il_robots#train-a-policy).
29
+ Below is the short version on how to train and run inference/eval:
30
 
31
+ ### Train from scratch
32
 
33
+ ```bash
34
+ lerobot-train \
35
+ --dataset.repo_id=${HF_USER}/<dataset> \
36
+ --policy.type=act \
37
+ --output_dir=outputs/train/<desired_policy_repo_id> \
38
+ --job_name=lerobot_training \
39
+ --policy.device=cuda \
40
+ --policy.repo_id=${HF_USER}/<desired_policy_repo_id>
41
+ --wandb.enable=true
42
+ ```
43
 
44
+ _Writes checkpoints to `outputs/train/<desired_policy_repo_id>/checkpoints/`._
 
 
 
 
 
 
 
45
 
46
+ ### Evaluate the policy/run inference
47
 
48
+ ```bash
49
+ lerobot-record \
50
+ --robot.type=so100_follower \
51
+ --dataset.repo_id=<hf_user>/eval_<dataset> \
52
+ --policy.path=<hf_user>/<desired_policy_repo_id> \
53
+ --episodes=10
54
+ ```
55
 
56
+ Prefix the dataset repo with **eval\_** and supply `--policy.path` pointing to a local or hub checkpoint.
 
 
 
 
 
 
57
 
58
  ---
59
 
60
+ ## Model Details
61
 
62
+ - **License:** apache-2.0
 
 
 
 
 
 
 
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:9e4686ff5b96f551d74de74fc63d6d1d3e589d37a05bb29ef01cb310630b0304
3
  size 1050861448
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b65d469a3f737698c6b8296325866325008b3b7c3c8ed6d44531bbcd2dabb9f2
3
  size 1050861448
train_config.json CHANGED
@@ -81,7 +81,7 @@
81
  },
82
  "env": {
83
  "type": "pusht",
84
- "task": null,
85
  "fps": 10,
86
  "features": {
87
  "action": {
@@ -215,12 +215,12 @@
215
  "cudnn_deterministic": false,
216
  "num_workers": 4,
217
  "batch_size": 8,
218
- "steps": 70,
219
  "eval_freq": 0,
220
  "log_freq": 200,
221
  "tolerance_s": 0.0001,
222
  "save_checkpoint": true,
223
- "save_freq": 20,
224
  "use_policy_training_preset": true,
225
  "optimizer": {
226
  "type": "adam",
 
81
  },
82
  "env": {
83
  "type": "pusht",
84
+ "task": "PushT-v0",
85
  "fps": 10,
86
  "features": {
87
  "action": {
 
215
  "cudnn_deterministic": false,
216
  "num_workers": 4,
217
  "batch_size": 8,
218
+ "steps": 7000,
219
  "eval_freq": 0,
220
  "log_freq": 200,
221
  "tolerance_s": 0.0001,
222
  "save_checkpoint": true,
223
+ "save_freq": 2000,
224
  "use_policy_training_preset": true,
225
  "optimizer": {
226
  "type": "adam",