Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
ccnets
/
causal-gpt-rl
like
1
Follow
CCNets, Inc.
2
Reinforcement Learning
Safetensors
offline-rl
mujoco
gpt
llama
autoregressive
causal-gpt-rl
License:
polyform-noncommercial-1.0.0
Model card
Files
Files and versions
xet
Community
main
causal-gpt-rl
45.7 MB
Ctrl+K
Ctrl+K
1 contributor
History:
10 commits
kissin42
Update bundle return table (re-measured 5ep seed=0 CPU; reflects new ant-v5 & humanoid-v5 bundles)
3248668
verified
about 2 hours ago
ant-v5
Update ant-v5 bundle (50ep eval: mean 1914 -> 2265, median 1562 -> 1969; context_length 24 -> 16)
about 15 hours ago
halfcheetah-v5
Add halfcheetah-v5 bundle
1 day ago
humanoid-v5
Update humanoid-v5 bundle (50ep eval: mean 3058 -> 4571, median 2330 -> 4882, surv 3/50 -> 15/50; context_length 24 -> 32)
about 2 hours ago
walker2d-v5
Add walker2d-v5 bundle
1 day ago
.gitattributes
Safe
1.52 kB
initial commit
3 days ago
README.md
2.07 kB
Update bundle return table (re-measured 5ep seed=0 CPU; reflects new ant-v5 & humanoid-v5 bundles)
about 2 hours ago