DeepRL-Class / ppo-LunarLander-v2 /_stable_baselines3_version
Ricardmc99's picture
PPO trained agent on 500,000 timesteps
e8e9073
1.6.0