wsagi commited on
Commit
4afe038
·
verified ·
1 Parent(s): d833f9c

Add files using upload-large-folder tool

Browse files
.gitattributes CHANGED
@@ -35,3 +35,5 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
  TDMPC2+HBench-h1-walk-v0+0/preview.mp4 filter=lfs diff=lfs merge=lfs -text
37
  TDMPC2+HBench-g1-walk-v0+0/preview.mp4 filter=lfs diff=lfs merge=lfs -text
 
 
 
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
  TDMPC2+HBench-h1-walk-v0+0/preview.mp4 filter=lfs diff=lfs merge=lfs -text
37
  TDMPC2+HBench-g1-walk-v0+0/preview.mp4 filter=lfs diff=lfs merge=lfs -text
38
+ assets/tdmpc2-g1-walk.mp4 filter=lfs diff=lfs merge=lfs -text
39
+ assets/tdmpc2-h1-walk.mp4 filter=lfs diff=lfs merge=lfs -text
README.md CHANGED
@@ -39,12 +39,22 @@ _TD-MPC2 is a model-based RL algorithm combining a world model with sample-based
39
 
40
  `success_bar = 700` (HumanoidBench locomotion threshold). _Success = episode return ≥ success_bar._
41
 
42
- ### 视频预览 / Video preview
 
 
 
 
 
 
 
 
 
 
43
 
44
- 每个任务子目录下有 `preview.mp4` 演示 deterministic eval(最佳 seed, GUI viewer 录屏):
45
 
46
- - **`TDMPC2+HBench-h1-walk-v0+0/preview.mp4`** H1 humanoid 完整 walking cycle,1000 步不倒
47
- - **`TDMPC2+HBench-g1-walk-v0+0/preview.mp4`** G1 humanoid 走路,含偶发踉跄
48
 
49
  ### 对比同任务 DR.Q 自训 ([wsagi/HumanoidBench-DR.Q](https://huggingface.co/wsagi/HumanoidBench-DR.Q))
50
 
@@ -112,16 +122,18 @@ DISPLAY=:0 conda run -n humanoidbench python scripts/tdmpc2_viewer.py \
112
  ## 📁 仓库结构 / Repo layout
113
 
114
  ```
115
- TDMPC2+HBench-h1-walk-v0+0/
116
- ├── step_950000.pt (32 MB — agent + world model + critic)
117
- ├── train.log (~370 KB — full training log)
118
- ├── ckpt_eval.csv (auto-eval per ckpt, N=3 quick)
119
- └── preview.mp4 (515 KB — GUI viewer recording)
120
-
121
- TDMPC2+HBench-g1-walk-v0+0/
122
- ├── step_950000.pt (32 MB)
123
- ── train.log (~700 KB)
124
- └── preview.mp4 (257 KB)
 
 
125
  ```
126
 
127
  `+0` 表示 seed=0。后续如果发其他 seed 会按 `+10` / `+20` 命名。
 
39
 
40
  `success_bar = 700` (HumanoidBench locomotion threshold). _Success = episode return ≥ success_bar._
41
 
42
+ ---
43
+
44
+ ## 🎬 演示 / Demos
45
+
46
+ ### H1-walk-v0 (Unitree H1, 19 DoF)
47
+
48
+ <video controls width="720" src="https://huggingface.co/wsagi/HumanoidBench-TD-MPC2/resolve/main/assets/tdmpc2-h1-walk.mp4"></video>
49
+
50
+ 完整 walking cycle,1000 步不倒。_Full walking cycle, runs through 1000 steps without falling._
51
+
52
+ ### G1-walk-v0 (Unitree G1, 23 DoF with PD + BlockedHands wrappers)
53
 
54
+ <video controls width="720" src="https://huggingface.co/wsagi/HumanoidBench-TD-MPC2/resolve/main/assets/tdmpc2-g1-walk.mp4"></video>
55
 
56
+ 37D action 屏蔽 14D 手指剩 23D,配 PD 位置控制;含偶发踉跄但 50% 集 ≥ success_bar。
57
+ _With 14D fingers masked (37→23 act dim) + PD position control; occasional stumbles but 50% of eps clear the success bar._
58
 
59
  ### 对比同任务 DR.Q 自训 ([wsagi/HumanoidBench-DR.Q](https://huggingface.co/wsagi/HumanoidBench-DR.Q))
60
 
 
122
  ## 📁 仓库结构 / Repo layout
123
 
124
  ```
125
+ HumanoidBench-TD-MPC2/
126
+ ├── README.md (this file)
127
+ ├── assets/
128
+ ├── tdmpc2-h1-walk.mp4 (515 KB H1-walk GUI recording)
129
+ └── tdmpc2-g1-walk.mp4 (257 KB — G1-walk GUI recording)
130
+ ├── TDMPC2+HBench-h1-walk-v0+0/
131
+ │ ├── step_950000.pt (32 MB — agent + world model + critic)
132
+ ├── train.log (~370 KB — full training log)
133
+ │ └── ckpt_eval.csv (auto-eval per ckpt, N=3 quick)
134
+ └── TDMPC2+HBench-g1-walk-v0+0/
135
+ ├── step_950000.pt (32 MB)
136
+ └── train.log (~700 KB)
137
  ```
138
 
139
  `+0` 表示 seed=0。后续如果发其他 seed 会按 `+10` / `+20` 命名。
assets/tdmpc2-g1-walk.mp4 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:47b47c47d802d399552e6506d5faf781e8fab36395df8406b97704eef04b90bf
3
+ size 256928
assets/tdmpc2-h1-walk.mp4 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2bad368d1a3dad93006d1d0a56f9ead19fbbd8e9f2ec62f1b199ca96e1b5c70f
3
+ size 515283