Upload README.md with huggingface_hub
Browse files
README.md
CHANGED
|
@@ -18,6 +18,10 @@ license: mit
|
|
| 18 |
|
| 19 |
_Self-trained DR.Q checkpoints that **beat** the public dmux/DR.Q baseline on HumanoidBench locomotion tasks._
|
| 20 |
|
|
|
|
|
|
|
|
|
|
|
|
|
| 21 |
DR.Q 是 TD3 + model-based 表征学习的离策略 RL 算法(encoder + policy ~13 MB 推理)。
|
| 22 |
本仓库收录在 [HumanoidBench](https://github.com/carlosferrazza/humanoid-bench) 上**从零自训通关**的 checkpoints。
|
| 23 |
|
|
|
|
| 18 |
|
| 19 |
_Self-trained DR.Q checkpoints that **beat** the public dmux/DR.Q baseline on HumanoidBench locomotion tasks._
|
| 20 |
|
| 21 |
+
> 🛠 **训练源码 / Training source**: <https://github.com/vitorcen/humanoid-training>
|
| 22 |
+
> 完整训练脚本、patches、eval harness、分析文档全在 GitHub 配套仓库。
|
| 23 |
+
> _Full training scripts, patches, eval harness, and analysis docs in the companion GitHub repo._
|
| 24 |
+
|
| 25 |
DR.Q 是 TD3 + model-based 表征学习的离策略 RL 算法(encoder + policy ~13 MB 推理)。
|
| 26 |
本仓库收录在 [HumanoidBench](https://github.com/carlosferrazza/humanoid-bench) 上**从零自训通关**的 checkpoints。
|
| 27 |
|