Sync README with repo: drop device=cpu, drop ?nw, add return numbers
Browse files
README.md
CHANGED
|
@@ -26,12 +26,14 @@ Stable under self-generated rollouts — long-horizon control without the drift
|
|
| 26 |
|
| 27 |
## Bundles in this repository
|
| 28 |
|
| 29 |
-
| Environment | Subfolder |
|
| 30 |
-
|---|---|
|
| 31 |
-
| Ant-v5 | `ant-v5` |
|
| 32 |
-
| HalfCheetah-v5 | `halfcheetah-v5` |
|
| 33 |
-
| Walker2d-v5 | `walker2d-v5` |
|
| 34 |
-
| Humanoid-v5 | `humanoid-v5` |
|
|
|
|
|
|
|
| 35 |
|
| 36 |
## Quick Start
|
| 37 |
|
|
@@ -47,7 +49,6 @@ env = gym.make("Ant-v5")
|
|
| 47 |
runner = load_runner_from_hub(
|
| 48 |
repo_id="ccnets/causal-gpt-rl",
|
| 49 |
subfolder="ant-v5",
|
| 50 |
-
device="cpu",
|
| 51 |
)
|
| 52 |
stats = run_episodes(env, runner, num_episodes=5, seed=0)
|
| 53 |
print(stats["return_mean"], stats["return_std"])
|
|
@@ -64,7 +65,7 @@ Each subfolder contains:
|
|
| 64 |
## Links
|
| 65 |
|
| 66 |
- **Code:** [github.com/ccnets-team/causal-gpt-rl](https://github.com/ccnets-team/causal-gpt-rl)
|
| 67 |
-
- **Training logs (W&B):** [wandb.ai/junhopark/Causal GPT-RL](https://wandb.ai/junhopark/Causal%20GPT-RL
|
| 68 |
- **Website:** [ccnets.org](https://ccnets.org)
|
| 69 |
|
| 70 |
## License
|
|
|
|
| 26 |
|
| 27 |
## Bundles in this repository
|
| 28 |
|
| 29 |
+
| Environment | Subfolder | Return (mean ± std) |
|
| 30 |
+
|---|---|---|
|
| 31 |
+
| Ant-v5 | `ant-v5` | 3033 ± 895 |
|
| 32 |
+
| HalfCheetah-v5 | `halfcheetah-v5` | 2066 ± 2776 |
|
| 33 |
+
| Walker2d-v5 | `walker2d-v5` | 2961 ± 756 |
|
| 34 |
+
| Humanoid-v5 | `humanoid-v5` | 3634 ± 2152 |
|
| 35 |
+
|
| 36 |
+
Returns are over 5 episodes with `seed=0`, run on CPU via `run_episodes`.
|
| 37 |
|
| 38 |
## Quick Start
|
| 39 |
|
|
|
|
| 49 |
runner = load_runner_from_hub(
|
| 50 |
repo_id="ccnets/causal-gpt-rl",
|
| 51 |
subfolder="ant-v5",
|
|
|
|
| 52 |
)
|
| 53 |
stats = run_episodes(env, runner, num_episodes=5, seed=0)
|
| 54 |
print(stats["return_mean"], stats["return_std"])
|
|
|
|
| 65 |
## Links
|
| 66 |
|
| 67 |
- **Code:** [github.com/ccnets-team/causal-gpt-rl](https://github.com/ccnets-team/causal-gpt-rl)
|
| 68 |
+
- **Training logs (W&B):** [wandb.ai/junhopark/Causal GPT-RL](https://wandb.ai/junhopark/Causal%20GPT-RL)
|
| 69 |
- **Website:** [ccnets.org](https://ccnets.org)
|
| 70 |
|
| 71 |
## License
|