Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

ccnets
/
causal-gpt-rl

Reinforcement Learning
Safetensors
offline-rl
mujoco
gpt
llama
autoregressive
causal-gpt-rl
Model card Files Files and versions
xet
Community
causal-gpt-rl
45.7 MB
Ctrl+K
Ctrl+K
  • 1 contributor
History: 10 commits
kissin42's picture
kissin42
Update bundle return table (re-measured 5ep seed=0 CPU; reflects new ant-v5 & humanoid-v5 bundles)
3248668 verified about 2 hours ago
  • ant-v5
    Update ant-v5 bundle (50ep eval: mean 1914 -> 2265, median 1562 -> 1969; context_length 24 -> 16) about 15 hours ago
  • halfcheetah-v5
    Add halfcheetah-v5 bundle 1 day ago
  • humanoid-v5
    Update humanoid-v5 bundle (50ep eval: mean 3058 -> 4571, median 2330 -> 4882, surv 3/50 -> 15/50; context_length 24 -> 32) about 2 hours ago
  • walker2d-v5
    Add walker2d-v5 bundle 1 day ago
  • .gitattributes
    1.52 kB
    initial commit 3 days ago
  • README.md
    2.07 kB
    Update bundle return table (re-measured 5ep seed=0 CPU; reflects new ant-v5 & humanoid-v5 bundles) about 2 hours ago