Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
pre63
/
entropy-trpo-weights
like
0
Reinforcement Learning
entropy-trpo
trpo
policy-optimization
mujoco
cartpole
arxiv:
2110.13373
License:
mit
Model card
Files
Files and versions
xet
Community
Copy to bucket
new
main
entropy-trpo-weights
Ctrl+K
Ctrl+K
1 contributor
History:
9492 commits
pre63
Update model card and training summary
28d884b
verified
1 minute ago
CartPole-v1
CartPole-v1/ppo/seed_2/latest checkpoint epoch 31250
1 day ago
Humanoid-v5
Humanoid-v5/ppo/seed_2/latest checkpoint epoch 1953
1 day ago
HumanoidStandup-v5
HumanoidStandup-v5/ero_trpo/seed_2/latest checkpoint epoch 1140
1 minute ago
benchmark_e3_1_3m
benchmark_e3_1_3m/CartPole-v1/trpo/seed_8/latest checkpoint epoch 3240
2 days ago
benchmark_e3_1m
benchmark_e3_1m/CartPole-v1/constant_erc_trpo/seed_2/latest checkpoint epoch 2000
2 days ago
benchmark_e3_3_xu_repro
benchmark_e3_3_xu_repro/CartPole-v1/erc_trpo/latest checkpoint epoch 6000
2 days ago
benchmark_e3_3m
benchmark_e3_3m/CartPole-v1/erc_trpo/seed_0/latest checkpoint epoch 1350
2 days ago
benchmark_e3_4_env
benchmark_e3_4_env/CartPole-v1/erc_trpo/latest checkpoint epoch 6000
2 days ago
benchmark_e4_cartpole
benchmark_e4_cartpole/CartPole-v1/random_ero_trpo/seed_2/latest checkpoint epoch 100
2 days ago
benchmark_e5_xu_elim
benchmark_e5_xu_elim/CartPole-v1/erc_trpo/seed_0/latest checkpoint epoch 4700
2 days ago
results
Update model card and training summary
1 minute ago
.gitattributes
Safe
1.52 kB
initial commit
11 days ago
README.md
14.5 kB
Update model card and training summary
1 minute ago