PPO Agent Playing LunarLander-v3
μ΄ λͺ¨λΈμ PPO(Proximal Policy Optimization) μκ³ λ¦¬μ¦μ λ°λ°λ₯λΆν° μ§μ ꡬννμ¬ νμ΅μν¨ LunarLander-v3 μμ΄μ νΈμ
λλ€.
리νλ μ΄ μμ

νμ΅ μ 보
- Algorithm: PPO
- Environment: LunarLander-v3
- Framework: PyTorch