rl-class-1 / ppo_lunar_lander_tutorial /policy.optimizer.pth

Commit History

Second training with 1M steps
e7f6ddc

Oleg Jarma Montoya commited on

Base version of the Lunar Lander agent
d687139

Oleg Jarma Montoya commited on