Rl-Unit1-CoLab / ppo-LunarLander-v2 /_stable_baselines3_version
alanwsx's picture
First try of Unit 1 CoLab with PPO trained with 1 million steps
33ba93c verified
2.0.0a5