Commit
fd2a056
ยท
verified ยท
1 Parent(s): aed5cca

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +33 -0
README.md ADDED
@@ -0,0 +1,33 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ library_name: pytorch
3
+ tags:
4
+ - reinforcement-learning
5
+ - deep-reinforcement-learning
6
+ - ppo
7
+ - LunarLander-v3
8
+ model-index:
9
+ - name: PPO
10
+ results:
11
+ - task:
12
+ type: reinforcement-learning
13
+ name: Reinforcement Learning
14
+ dataset:
15
+ name: LunarLander-v3
16
+ type: LunarLander-v3
17
+ metrics:
18
+ - type: mean_reward
19
+ value: 200.0 +/- 50.0
20
+ name: mean_reward
21
+ ---
22
+
23
+ # PPO Agent Playing LunarLander-v3
24
+
25
+ ์ด ๋ชจ๋ธ์€ PPO(Proximal Policy Optimization) ์•Œ๊ณ ๋ฆฌ์ฆ˜์„ ๋ฐ‘๋ฐ”๋‹ฅ๋ถ€ํ„ฐ ์ง์ ‘ ๊ตฌํ˜„ํ•˜์—ฌ ํ•™์Šต์‹œํ‚จ LunarLander-v3 ์—์ด์ „ํŠธ์ž…๋‹ˆ๋‹ค.
26
+
27
+ ## ๋ฆฌํ”Œ๋ ˆ์ด ์˜์ƒ
28
+ ![์—์ด์ „ํŠธ ํ”Œ๋ ˆ์ด](replay.mp4)
29
+
30
+ ## ํ•™์Šต ์ •๋ณด
31
+ - **Algorithm**: PPO
32
+ - **Environment**: LunarLander-v3
33
+ - **Framework**: PyTorch