smileynet commited on
Commit
a1c57ee
·
verified ·
1 Parent(s): 2deea95

Upload model files

Browse files

Uploading model files including README, and metadata

Files changed (4) hide show
  1. README.md +2 -2
  2. ppo-LunarLander-v2.zip +3 -0
  3. replay.mp4 +0 -0
  4. results.json +3 -3
README.md CHANGED
@@ -7,7 +7,7 @@ model-index:
7
  - metrics:
8
  - name: mean_reward
9
  type: mean_reward
10
- value: 277.18 +/- 18.43
11
  task:
12
  name: LunarLander-v2
13
  type: reinforcement-learning
@@ -70,4 +70,4 @@ The model was trained using the following hyperparameters:
70
  ```
71
 
72
  ## Results
73
- The trained agent achieved a mean reward of 277.18 +/- 18.43 over 10 evaluation episodes.
 
7
  - metrics:
8
  - name: mean_reward
9
  type: mean_reward
10
+ value: 273.11 +/- 18.97
11
  task:
12
  name: LunarLander-v2
13
  type: reinforcement-learning
 
70
  ```
71
 
72
  ## Results
73
+ The trained agent achieved a mean reward of 273.11 +/- 18.97 over 10 evaluation episodes.
ppo-LunarLander-v2.zip ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6da70a465c67a86e6a12936884f257da97fd2f90121d81694bbfbe0f3add3848
3
+ size 153864
replay.mp4 CHANGED
Binary files a/replay.mp4 and b/replay.mp4 differ
 
results.json CHANGED
@@ -1,7 +1,7 @@
1
  {
2
  "env_id": "LunarLander-v2",
3
- "mean_reward": 277.17615870000003,
4
- "std_reward": 18.428258312403294,
5
  "n_eval_episodes": 10,
6
- "eval_datetime": "2024-09-23T10:34:50.992509"
7
  }
 
1
  {
2
  "env_id": "LunarLander-v2",
3
+ "mean_reward": 273.1133084,
4
+ "std_reward": 18.969926691445075,
5
  "n_eval_episodes": 10,
6
+ "eval_datetime": "2024-09-23T21:37:57.911995"
7
  }