ncduy commited on
Commit
22b5a33
·
verified ·
1 Parent(s): 646f09e

Upload PPO LunarLander-v3 trained agent

Browse files
Files changed (5) hide show
  1. .gitattributes +1 -0
  2. README.md +5 -2
  3. ppo-LunarLander-v3.zip +1 -1
  4. replay.mp4 +3 -0
  5. results.json +1 -1
.gitattributes CHANGED
@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ replay.mp4 filter=lfs diff=lfs merge=lfs -text
README.md CHANGED
@@ -16,16 +16,19 @@ model-index:
16
  type: LunarLander-v3
17
  metrics:
18
  - type: mean_reward
19
- value: 262.11 +/- 18.76
20
  name: mean_reward
21
  verified: false
22
  ---
23
 
24
  # **PPO** Agent playing **LunarLander-v3**
 
25
  This is a trained model of a **PPO** agent playing **LunarLander-v3**
26
  using the [stable-baselines3 library](https://github.com/DLR-RM/stable-baselines3).
27
 
 
28
  ## Usage (with Stable-baselines3)
 
29
  TODO: Add your code
30
 
31
 
@@ -34,4 +37,4 @@ from stable_baselines3 import ...
34
  from huggingface_sb3 import load_from_hub
35
 
36
  ...
37
- ```
 
16
  type: LunarLander-v3
17
  metrics:
18
  - type: mean_reward
19
+ value: 249.05 +/- 22.84
20
  name: mean_reward
21
  verified: false
22
  ---
23
 
24
  # **PPO** Agent playing **LunarLander-v3**
25
+
26
  This is a trained model of a **PPO** agent playing **LunarLander-v3**
27
  using the [stable-baselines3 library](https://github.com/DLR-RM/stable-baselines3).
28
 
29
+
30
  ## Usage (with Stable-baselines3)
31
+
32
  TODO: Add your code
33
 
34
 
 
37
  from huggingface_sb3 import load_from_hub
38
 
39
  ...
40
+ ```
ppo-LunarLander-v3.zip CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:9e08c5afd64ac052ba260848d4d560e108cf76feed6667b7828dc1c723a7d495
3
  size 150018
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0f8d4ee68f5c69a211490827c47e00a3bed50fd7442aabc851341dc1ed7212d3
3
  size 150018
replay.mp4 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:13a189a937069e1241cf3929c37295290d11724831cd87b6a4f36fe77af3f523
3
+ size 172294
results.json CHANGED
@@ -1 +1 @@
1
- {"mean_reward": 262.1123929, "std_reward": 18.760983661343836, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2025-08-29T09:28:31.203644"}
 
1
+ {"mean_reward": 249.04974040000002, "std_reward": 22.84209448961791, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2025-08-29T09:29:04.881144"}