HoRD Models: Checkpoints for Robust Humanoid Motion Control

This model card describes the released checkpoints for HoRD (History-Conditioned Reinforcement Learning and Online Distillation), a two-stage framework for robust humanoid control under domain shift.

Model Overview

HoRD checkpoints are trained for robust humanoid motion imitation and control with online adaptation capability.

  • Stage 1 (Teacher RL): an expert policy trained with privileged observations and domain randomization.
  • Stage 2 (Online Distillation): a deployable student policy distilled from the teacher under partial observability and sparse commands.

These checkpoints are intended for evaluation and downstream experiments in IsaacLab and Genesis settings.

Model Contents

Typical release artifact:

  • your_checkpoint.ckpt: pretrained HoRD checkpoint for evaluation or fine-tuning.

Quick Start

Install Hugging Face CLI:

pip install -U "huggingface_hub[cli]"

Download a checkpoint from the model repository:

mkdir -p results
huggingface-cli download tony0517/HoRD your_checkpoint.ckpt --local-dir results --local-dir-use-symlinks False

Use in HoRD evaluation:

+checkpoint=results/your_checkpoint.ckpt

Example Evaluation Command

python hord/eval_agent.py +exp=full_body_tracker/transformer +robot=g1 +simulator=isaaclab +motion_file=data/train_g1_all.pt +experiment_name=full_body_tracker ++headless=False +checkpoint=results/your_checkpoint.ckpt ++num_envs=1

License

This model card is released under the MIT License.

Citation

If you find these checkpoints useful, please cite:

@article{wang2026hord,
  title={HoRD: Robust Humanoid Control via History-Conditioned Reinforcement Learning and Online Distillation},
  author={Wang, Puyue and Hu, Jiawei and Gao, Yan and Wang, Junyan and Zhang, Yu and Dobbie, Gillian and Gu, Tao and Johal, Wafa and Dang, Ting and Jia, Hong},
  journal={arXiv preprint arXiv:2602.04412},
  year={2026}
}
Downloads last month

-

Downloads are not tracked for this model. How to track
Video Preview
loading

Paper for tony0517/HoRD