wsagi commited on
Commit
91ecef7
·
verified ·
1 Parent(s): fe8aed0

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +4 -0
README.md CHANGED
@@ -18,6 +18,10 @@ license: mit
18
 
19
  _Self-trained DR.Q checkpoints that **beat** the public dmux/DR.Q baseline on HumanoidBench locomotion tasks._
20
 
 
 
 
 
21
  DR.Q 是 TD3 + model-based 表征学习的离策略 RL 算法(encoder + policy ~13 MB 推理)。
22
  本仓库收录在 [HumanoidBench](https://github.com/carlosferrazza/humanoid-bench) 上**从零自训通关**的 checkpoints。
23
 
 
18
 
19
  _Self-trained DR.Q checkpoints that **beat** the public dmux/DR.Q baseline on HumanoidBench locomotion tasks._
20
 
21
+ > 🛠 **训练源码 / Training source**: <https://github.com/vitorcen/humanoid-training>
22
+ > 完整训练脚本、patches、eval harness、分析文档全在 GitHub 配套仓库。
23
+ > _Full training scripts, patches, eval harness, and analysis docs in the companion GitHub repo._
24
+
25
  DR.Q 是 TD3 + model-based 表征学习的离策略 RL 算法(encoder + policy ~13 MB 推理)。
26
  本仓库收录在 [HumanoidBench](https://github.com/carlosferrazza/humanoid-bench) 上**从零自训通关**的 checkpoints。
27