tony0517 commited on
Commit
b1c0e20
·
verified ·
1 Parent(s): 521a360

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +81 -0
README.md ADDED
@@ -0,0 +1,81 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ pretty_name: HoRD Models
4
+ language:
5
+ - en
6
+ tags:
7
+ - robotics
8
+ - humanoid-robot
9
+ - reinforcement-learning
10
+ - motion-imitation
11
+ - policy-learning
12
+ ---
13
+
14
+ # HoRD Models: Checkpoints for Robust Humanoid Motion Control
15
+
16
+ This model card describes the released checkpoints for **HoRD** (History-Conditioned Reinforcement Learning and Online Distillation), a two-stage framework for robust humanoid control under domain shift.
17
+
18
+ - **Paper**: [HoRD: Robust Humanoid Control via History-Conditioned Reinforcement Learning and Online Distillation](https://arxiv.org/abs/2602.04412)
19
+ - **Project Page**: [https://tonywang-0517.github.io/hord/](https://tonywang-0517.github.io/hord/)
20
+ - **Code Repository**: [https://github.com/tonywang-0517/hord](https://github.com/tonywang-0517/hord)
21
+ - **Dataset Repository**: [https://huggingface.co/datasets/tony0517/HoRD](https://huggingface.co/datasets/tony0517/HoRD)
22
+ - **Model Repository**: [https://huggingface.co/tony0517/HoRD](https://huggingface.co/tony0517/HoRD)
23
+
24
+ ## Model Overview
25
+
26
+ HoRD checkpoints are trained for robust humanoid motion imitation and control with online adaptation capability.
27
+
28
+ - **Stage 1 (Teacher RL)**: an expert policy trained with privileged observations and domain randomization.
29
+ - **Stage 2 (Online Distillation)**: a deployable student policy distilled from the teacher under partial observability and sparse commands.
30
+
31
+ These checkpoints are intended for evaluation and downstream experiments in IsaacLab and Genesis settings.
32
+
33
+ ## Model Contents
34
+
35
+ Typical release artifact:
36
+
37
+ - `your_checkpoint.ckpt`: pretrained HoRD checkpoint for evaluation or fine-tuning.
38
+
39
+ ## Quick Start
40
+
41
+ Install Hugging Face CLI:
42
+
43
+ ```bash
44
+ pip install -U "huggingface_hub[cli]"
45
+ ```
46
+
47
+ Download a checkpoint from the model repository:
48
+
49
+ ```bash
50
+ mkdir -p results
51
+ huggingface-cli download tony0517/HoRD your_checkpoint.ckpt --local-dir results --local-dir-use-symlinks False
52
+ ```
53
+
54
+ Use in HoRD evaluation:
55
+
56
+ ```bash
57
+ +checkpoint=results/your_checkpoint.ckpt
58
+ ```
59
+
60
+ ## Example Evaluation Command
61
+
62
+ ```bash
63
+ python hord/eval_agent.py +exp=full_body_tracker/transformer +robot=g1 +simulator=isaaclab +motion_file=data/train_g1_all.pt +experiment_name=full_body_tracker ++headless=False +checkpoint=results/your_checkpoint.ckpt ++num_envs=1
64
+ ```
65
+
66
+ ## License
67
+
68
+ This model card is released under the **MIT License**.
69
+
70
+ ## Citation
71
+
72
+ If you find these checkpoints useful, please cite:
73
+
74
+ ```bibtex
75
+ @article{wang2026hord,
76
+ title={HoRD: Robust Humanoid Control via History-Conditioned Reinforcement Learning and Online Distillation},
77
+ author={Wang, Puyue and Hu, Jiawei and Gao, Yan and Wang, Junyan and Zhang, Yu and Dobbie, Gillian and Gu, Tao and Johal, Wafa and Dang, Ting and Jia, Hong},
78
+ journal={arXiv preprint arXiv:2602.04412},
79
+ year={2026}
80
+ }
81
+ ```