GuanOrg
/

DeepRLCourse2022

Reinforcement Learning

stable-baselines3

deep-reinforcement-learning

Eval Results (legacy)

Model card Files Files and versions

DeepRLCourse2022

1.12 MB

Ctrl+K

Ctrl+K

1 contributor

History: 4 commits

bguan's picture

bguan's lunar lander model #3 using PPO trained for 1M timesteps

ee17131 about 4 years ago

bguan_ppo_lunarlander
bguan's lunar lander model using PPO trained for 500K timesteps about 4 years ago
bguan_ppo_lunarlander2
bguan's lunar lander model #2 using PPO trained for 500K timesteps about 4 years ago
bguan_ppo_lunarlander3
bguan's lunar lander model #3 using PPO trained for 1M timesteps about 4 years ago
.gitattributes

1.22 kB
bguan's lunar lander model using PPO trained for 500K timesteps about 4 years ago
README.md

677 Bytes
bguan's lunar lander model #3 using PPO trained for 1M timesteps about 4 years ago
bguan_ppo_lunarlander.zip

144 kB
xet

bguan's lunar lander model using PPO trained for 500K timesteps about 4 years ago
bguan_ppo_lunarlander2.zip

144 kB
xet

bguan's lunar lander model #2 using PPO trained for 500K timesteps about 4 years ago
bguan_ppo_lunarlander3.zip

144 kB
xet

bguan's lunar lander model #3 using PPO trained for 1M timesteps about 4 years ago
config.json

14.4 kB
bguan's lunar lander model #3 using PPO trained for 1M timesteps about 4 years ago
replay.mp4

245 kB
xet

bguan's lunar lander model #3 using PPO trained for 1M timesteps about 4 years ago
results.json

165 Bytes
bguan's lunar lander model #3 using PPO trained for 1M timesteps about 4 years ago