kissin42 commited on
Commit
3fa83dd
·
verified ·
1 Parent(s): db203c7

Sync README with repo: drop device=cpu, drop ?nw, add return numbers

Browse files
Files changed (1) hide show
  1. README.md +9 -8
README.md CHANGED
@@ -26,12 +26,14 @@ Stable under self-generated rollouts — long-horizon control without the drift
26
 
27
  ## Bundles in this repository
28
 
29
- | Environment | Subfolder |
30
- |---|---|
31
- | Ant-v5 | `ant-v5` |
32
- | HalfCheetah-v5 | `halfcheetah-v5` |
33
- | Walker2d-v5 | `walker2d-v5` |
34
- | Humanoid-v5 | `humanoid-v5` |
 
 
35
 
36
  ## Quick Start
37
 
@@ -47,7 +49,6 @@ env = gym.make("Ant-v5")
47
  runner = load_runner_from_hub(
48
  repo_id="ccnets/causal-gpt-rl",
49
  subfolder="ant-v5",
50
- device="cpu",
51
  )
52
  stats = run_episodes(env, runner, num_episodes=5, seed=0)
53
  print(stats["return_mean"], stats["return_std"])
@@ -64,7 +65,7 @@ Each subfolder contains:
64
  ## Links
65
 
66
  - **Code:** [github.com/ccnets-team/causal-gpt-rl](https://github.com/ccnets-team/causal-gpt-rl)
67
- - **Training logs (W&B):** [wandb.ai/junhopark/Causal GPT-RL](https://wandb.ai/junhopark/Causal%20GPT-RL?nw)
68
  - **Website:** [ccnets.org](https://ccnets.org)
69
 
70
  ## License
 
26
 
27
  ## Bundles in this repository
28
 
29
+ | Environment | Subfolder | Return (mean ± std) |
30
+ |---|---|---|
31
+ | Ant-v5 | `ant-v5` | 3033 ± 895 |
32
+ | HalfCheetah-v5 | `halfcheetah-v5` | 2066 ± 2776 |
33
+ | Walker2d-v5 | `walker2d-v5` | 2961 ± 756 |
34
+ | Humanoid-v5 | `humanoid-v5` | 3634 ± 2152 |
35
+
36
+ Returns are over 5 episodes with `seed=0`, run on CPU via `run_episodes`.
37
 
38
  ## Quick Start
39
 
 
49
  runner = load_runner_from_hub(
50
  repo_id="ccnets/causal-gpt-rl",
51
  subfolder="ant-v5",
 
52
  )
53
  stats = run_episodes(env, runner, num_episodes=5, seed=0)
54
  print(stats["return_mean"], stats["return_std"])
 
65
  ## Links
66
 
67
  - **Code:** [github.com/ccnets-team/causal-gpt-rl](https://github.com/ccnets-team/causal-gpt-rl)
68
+ - **Training logs (W&B):** [wandb.ai/junhopark/Causal GPT-RL](https://wandb.ai/junhopark/Causal%20GPT-RL)
69
  - **Website:** [ccnets.org](https://ccnets.org)
70
 
71
  ## License