smileynet commited on
Commit
0774da5
·
verified ·
1 Parent(s): 1d67981

Upload model files

Browse files

Uploading model files including README, and metadata

Files changed (3) hide show
  1. README.md +2 -2
  2. replay.mp4 +0 -0
  3. results.json +3 -3
README.md CHANGED
@@ -7,7 +7,7 @@ model-index:
7
  - metrics:
8
  - name: mean_reward
9
  type: mean_reward
10
- value: 273.11 +/- 18.97
11
  task:
12
  name: LunarLander-v2
13
  type: reinforcement-learning
@@ -70,4 +70,4 @@ The model was trained using the following hyperparameters:
70
  ```
71
 
72
  ## Results
73
- The trained agent achieved a mean reward of 273.11 +/- 18.97 over 10 evaluation episodes.
 
7
  - metrics:
8
  - name: mean_reward
9
  type: mean_reward
10
+ value: 269.36 +/- 28.12
11
  task:
12
  name: LunarLander-v2
13
  type: reinforcement-learning
 
70
  ```
71
 
72
  ## Results
73
+ The trained agent achieved a mean reward of 269.36 +/- 28.12 over 10 evaluation episodes.
replay.mp4 CHANGED
Binary files a/replay.mp4 and b/replay.mp4 differ
 
results.json CHANGED
@@ -1,7 +1,7 @@
1
  {
2
  "env_id": "LunarLander-v2",
3
- "mean_reward": 273.1133084,
4
- "std_reward": 18.969926691445075,
5
  "n_eval_episodes": 10,
6
- "eval_datetime": "2024-09-23T21:37:57.911995"
7
  }
 
1
  {
2
  "env_id": "LunarLander-v2",
3
+ "mean_reward": 269.35587849999996,
4
+ "std_reward": 28.121482914355596,
5
  "n_eval_episodes": 10,
6
+ "eval_datetime": "2024-09-23T22:23:46.583076"
7
  }