Add files using upload-large-folder tool
Browse files- .gitattributes +2 -0
- README.md +26 -14
- assets/tdmpc2-g1-walk.mp4 +3 -0
- assets/tdmpc2-h1-walk.mp4 +3 -0
.gitattributes
CHANGED
|
@@ -35,3 +35,5 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
|
| 35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
| 36 |
TDMPC2+HBench-h1-walk-v0+0/preview.mp4 filter=lfs diff=lfs merge=lfs -text
|
| 37 |
TDMPC2+HBench-g1-walk-v0+0/preview.mp4 filter=lfs diff=lfs merge=lfs -text
|
|
|
|
|
|
|
|
|
| 35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
| 36 |
TDMPC2+HBench-h1-walk-v0+0/preview.mp4 filter=lfs diff=lfs merge=lfs -text
|
| 37 |
TDMPC2+HBench-g1-walk-v0+0/preview.mp4 filter=lfs diff=lfs merge=lfs -text
|
| 38 |
+
assets/tdmpc2-g1-walk.mp4 filter=lfs diff=lfs merge=lfs -text
|
| 39 |
+
assets/tdmpc2-h1-walk.mp4 filter=lfs diff=lfs merge=lfs -text
|
README.md
CHANGED
|
@@ -39,12 +39,22 @@ _TD-MPC2 is a model-based RL algorithm combining a world model with sample-based
|
|
| 39 |
|
| 40 |
`success_bar = 700` (HumanoidBench locomotion threshold). _Success = episode return ≥ success_bar._
|
| 41 |
|
| 42 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 43 |
|
| 44 |
-
|
| 45 |
|
| 46 |
-
|
| 47 |
-
|
| 48 |
|
| 49 |
### 对比同任务 DR.Q 自训 ([wsagi/HumanoidBench-DR.Q](https://huggingface.co/wsagi/HumanoidBench-DR.Q))
|
| 50 |
|
|
@@ -112,16 +122,18 @@ DISPLAY=:0 conda run -n humanoidbench python scripts/tdmpc2_viewer.py \
|
|
| 112 |
## 📁 仓库结构 / Repo layout
|
| 113 |
|
| 114 |
```
|
| 115 |
-
|
| 116 |
-
├──
|
| 117 |
-
├──
|
| 118 |
-
├──
|
| 119 |
-
└──
|
| 120 |
-
|
| 121 |
-
|
| 122 |
-
├──
|
| 123 |
-
|
| 124 |
-
└──
|
|
|
|
|
|
|
| 125 |
```
|
| 126 |
|
| 127 |
`+0` 表示 seed=0。后续如果发其他 seed 会按 `+10` / `+20` 命名。
|
|
|
|
| 39 |
|
| 40 |
`success_bar = 700` (HumanoidBench locomotion threshold). _Success = episode return ≥ success_bar._
|
| 41 |
|
| 42 |
+
---
|
| 43 |
+
|
| 44 |
+
## 🎬 演示 / Demos
|
| 45 |
+
|
| 46 |
+
### H1-walk-v0 (Unitree H1, 19 DoF)
|
| 47 |
+
|
| 48 |
+
<video controls width="720" src="https://huggingface.co/wsagi/HumanoidBench-TD-MPC2/resolve/main/assets/tdmpc2-h1-walk.mp4"></video>
|
| 49 |
+
|
| 50 |
+
完整 walking cycle,1000 步不倒。_Full walking cycle, runs through 1000 steps without falling._
|
| 51 |
+
|
| 52 |
+
### G1-walk-v0 (Unitree G1, 23 DoF with PD + BlockedHands wrappers)
|
| 53 |
|
| 54 |
+
<video controls width="720" src="https://huggingface.co/wsagi/HumanoidBench-TD-MPC2/resolve/main/assets/tdmpc2-g1-walk.mp4"></video>
|
| 55 |
|
| 56 |
+
37D action 屏蔽 14D 手指剩 23D,配 PD 位置控制;含偶发踉跄但 50% 集 ≥ success_bar。
|
| 57 |
+
_With 14D fingers masked (37→23 act dim) + PD position control; occasional stumbles but 50% of eps clear the success bar._
|
| 58 |
|
| 59 |
### 对比同任务 DR.Q 自训 ([wsagi/HumanoidBench-DR.Q](https://huggingface.co/wsagi/HumanoidBench-DR.Q))
|
| 60 |
|
|
|
|
| 122 |
## 📁 仓库结构 / Repo layout
|
| 123 |
|
| 124 |
```
|
| 125 |
+
HumanoidBench-TD-MPC2/
|
| 126 |
+
├── README.md (this file)
|
| 127 |
+
├── assets/
|
| 128 |
+
│ ├── tdmpc2-h1-walk.mp4 (515 KB — H1-walk GUI recording)
|
| 129 |
+
│ └── tdmpc2-g1-walk.mp4 (257 KB — G1-walk GUI recording)
|
| 130 |
+
├── TDMPC2+HBench-h1-walk-v0+0/
|
| 131 |
+
│ ├── step_950000.pt (32 MB — agent + world model + critic)
|
| 132 |
+
│ ├── train.log (~370 KB — full training log)
|
| 133 |
+
│ └── ckpt_eval.csv (auto-eval per ckpt, N=3 quick)
|
| 134 |
+
└── TDMPC2+HBench-g1-walk-v0+0/
|
| 135 |
+
├── step_950000.pt (32 MB)
|
| 136 |
+
└── train.log (~700 KB)
|
| 137 |
```
|
| 138 |
|
| 139 |
`+0` 表示 seed=0。后续如果发其他 seed 会按 `+10` / `+20` 命名。
|
assets/tdmpc2-g1-walk.mp4
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:47b47c47d802d399552e6506d5faf781e8fab36395df8406b97704eef04b90bf
|
| 3 |
+
size 256928
|
assets/tdmpc2-h1-walk.mp4
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:2bad368d1a3dad93006d1d0a56f9ead19fbbd8e9f2ec62f1b199ca96e1b5c70f
|
| 3 |
+
size 515283
|