PPO Agent Playing LunarLander-v3

이 λͺ¨λΈμ€ PPO(Proximal Policy Optimization) μ•Œκ³ λ¦¬μ¦˜μ„ λ°‘λ°”λ‹₯λΆ€ν„° 직접 κ΅¬ν˜„ν•˜μ—¬ ν•™μŠ΅μ‹œν‚¨ LunarLander-v3 μ—μ΄μ „νŠΈμž…λ‹ˆλ‹€.

λ¦¬ν”Œλ ˆμ΄ μ˜μƒ

μ—μ΄μ „νŠΈ ν”Œλ ˆμ΄

ν•™μŠ΅ 정보

  • Algorithm: PPO
  • Environment: LunarLander-v3
  • Framework: PyTorch
Downloads last month

-

Downloads are not tracked for this model. How to track
Video Preview
loading

Evaluation results