Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up

pre63
/
entropy-trpo-weights

Reinforcement Learning
entropy-trpo
trpo
policy-optimization
mujoco
cartpole
Model card Files Files and versions
xet
Community
entropy-trpo-weights
Ctrl+K
Ctrl+K
  • 1 contributor
History: 9492 commits
pre63's picture
pre63
Update model card and training summary
28d884b verified 1 minute ago
  • CartPole-v1
    CartPole-v1/ppo/seed_2/latest checkpoint epoch 31250 1 day ago
  • Humanoid-v5
    Humanoid-v5/ppo/seed_2/latest checkpoint epoch 1953 1 day ago
  • HumanoidStandup-v5
    HumanoidStandup-v5/ero_trpo/seed_2/latest checkpoint epoch 1140 1 minute ago
  • benchmark_e3_1_3m
    benchmark_e3_1_3m/CartPole-v1/trpo/seed_8/latest checkpoint epoch 3240 2 days ago
  • benchmark_e3_1m
    benchmark_e3_1m/CartPole-v1/constant_erc_trpo/seed_2/latest checkpoint epoch 2000 2 days ago
  • benchmark_e3_3_xu_repro
    benchmark_e3_3_xu_repro/CartPole-v1/erc_trpo/latest checkpoint epoch 6000 2 days ago
  • benchmark_e3_3m
    benchmark_e3_3m/CartPole-v1/erc_trpo/seed_0/latest checkpoint epoch 1350 2 days ago
  • benchmark_e3_4_env
    benchmark_e3_4_env/CartPole-v1/erc_trpo/latest checkpoint epoch 6000 2 days ago
  • benchmark_e4_cartpole
    benchmark_e4_cartpole/CartPole-v1/random_ero_trpo/seed_2/latest checkpoint epoch 100 2 days ago
  • benchmark_e5_xu_elim
    benchmark_e5_xu_elim/CartPole-v1/erc_trpo/seed_0/latest checkpoint epoch 4700 2 days ago
  • results
    Update model card and training summary 1 minute ago
  • .gitattributes
    1.52 kB
    initial commit 11 days ago
  • README.md
    14.5 kB
    Update model card and training summary 1 minute ago