Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

bguan
/
lunar_lander_v2_ppo_5

Reinforcement Learning
stable-baselines3
LunarLander-v2
deep-reinforcement-learning
Eval Results (legacy)
Model card Files Files and versions
xet
Community
lunar_lander_v2_ppo_5
553 kB
  • 1 contributor
History: 3 commits
bguan's picture
bguan
lunar lander model #5, using PPO trained with learning rate 0.0005, gamma 0.995, for 1M timesteps
57e96c5 over 3 years ago
  • bguan_ppo_lunarlander5
    lunar lander model #5, using PPO trained with learning rate 0.0005, gamma 0.995, for 1M timesteps over 3 years ago
  • .gitattributes
    1.22 kB
    lunar lander model #5, using PPO trained with learning rate 0.0005, gamma 0.995, for 1M timesteps over 3 years ago
  • README.md
    677 Bytes
    lunar lander model #5, using PPO trained with learning rate 0.0005, gamma 0.995, for 1M timesteps over 3 years ago
  • bguan_ppo_lunarlander5.zip

    Pickle imports

    • No problematic imports detected

    What is a pickle import?

    145 kB
    xet
    lunar lander model #5, using PPO trained with learning rate 0.0005, gamma 0.995, for 1M timesteps over 3 years ago
  • config.json
    15 kB
    lunar lander model #5, using PPO trained with learning rate 0.0005, gamma 0.995, for 1M timesteps over 3 years ago
  • replay.mp4
    247 kB
    xet
    lunar lander model #5, using PPO trained with learning rate 0.0005, gamma 0.995, for 1M timesteps over 3 years ago
  • results.json
    163 Bytes
    lunar lander model #5, using PPO trained with learning rate 0.0005, gamma 0.995, for 1M timesteps over 3 years ago