First implementation of PPO method with lunar lander environment. Reinforcement Learning course. eaaa1ab victormmp1 commited on Dec 12, 2022