bguan
/

lunar_lander_v2_ppo_5

Reinforcement Learning

stable-baselines3

deep-reinforcement-learning

Eval Results (legacy)

Model card Files Files and versions

lunar_lander_v2_ppo_5

Ctrl+K

Ctrl+K

1 contributor

History: 3 commits

bguan's picture

lunar lander model #5, using PPO trained with learning rate 0.0005, gamma 0.995, for 1M timesteps

57e96c5 about 4 years ago

bguan_ppo_lunarlander5
lunar lander model #5, using PPO trained with learning rate 0.0005, gamma 0.995, for 1M timesteps about 4 years ago
.gitattributes

1.22 kB
lunar lander model #5, using PPO trained with learning rate 0.0005, gamma 0.995, for 1M timesteps about 4 years ago
README.md

677 Bytes
lunar lander model #5, using PPO trained with learning rate 0.0005, gamma 0.995, for 1M timesteps about 4 years ago
bguan_ppo_lunarlander5.zip
Pickle imports
- No problematic imports detected
What is a pickle import?
145 kB
xet

lunar lander model #5, using PPO trained with learning rate 0.0005, gamma 0.995, for 1M timesteps about 4 years ago
config.json

15 kB
lunar lander model #5, using PPO trained with learning rate 0.0005, gamma 0.995, for 1M timesteps about 4 years ago
replay.mp4

247 kB
xet

lunar lander model #5, using PPO trained with learning rate 0.0005, gamma 0.995, for 1M timesteps about 4 years ago
results.json

163 Bytes
lunar lander model #5, using PPO trained with learning rate 0.0005, gamma 0.995, for 1M timesteps about 4 years ago