saifyxpro commited on
Commit
ee1d620
·
verified ·
1 Parent(s): b7ffaf6

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +7 -19
README.md CHANGED
@@ -1,7 +1,7 @@
1
  ---
2
  library_name: stable-baselines3
3
  tags:
4
- - LunarLander-v3
5
  - deep-reinforcement-learning
6
  - reinforcement-learning
7
  - stable-baselines3
@@ -12,26 +12,14 @@ model-index:
12
  type: reinforcement-learning
13
  name: reinforcement-learning
14
  dataset:
15
- name: LunarLander-v3
16
- type: LunarLander-v3
17
  metrics:
18
  - type: mean_reward
19
- value: 249.17 +/- 22.09
20
  name: mean_reward
21
  verified: false
22
  ---
23
-
24
- # **PPO** Agent playing **LunarLander-v3**
25
- This is a trained model of a **PPO** agent playing **LunarLander-v3**
26
- using the [stable-baselines3 library](https://github.com/DLR-RM/stable-baselines3).
27
-
28
- ## Usage (with Stable-baselines3)
29
- TODO: Add your code
30
-
31
-
32
- ```python
33
- from stable_baselines3 import ...
34
- from huggingface_sb3 import load_from_hub
35
-
36
- ...
37
- ```
 
1
  ---
2
  library_name: stable-baselines3
3
  tags:
4
+ - LunarLander-v2
5
  - deep-reinforcement-learning
6
  - reinforcement-learning
7
  - stable-baselines3
 
12
  type: reinforcement-learning
13
  name: reinforcement-learning
14
  dataset:
15
+ name: LunarLander-v2
16
+ type: LunarLander-v2
17
  metrics:
18
  - type: mean_reward
19
+ value: 260.46 +/- 15.40
20
  name: mean_reward
21
  verified: false
22
  ---
23
+ # PPO Agent playing LunarLander-v2
24
+ This is a trained model of a PPO agent playing LunarLander-v2.
25
+ Mean reward: 260.46 +/- 15.40