ppo-lunarlander-v2 / initial_model /policy.optimizer.pth

Commit History

PPO trained on the Lunar Lander module
e0a7b24

culteejen commited on