proyrb commited on
Commit
50ed6f4
·
verified ·
1 Parent(s): e666203

Push agent to the Hub

Browse files
README.md CHANGED
@@ -17,14 +17,14 @@ model-index:
17
  type: LunarLander-v2
18
  metrics:
19
  - type: mean_reward
20
- value: -181.90 +/- 126.29
21
  name: mean_reward
22
  verified: false
23
  ---
24
  # PPO Agent Playing LunarLander-v2
25
  This is a trained model of a PPO agent playing LunarLander-v2.
26
  ## Evaluation Results
27
- - Mean Reward: -181.90 ± 126.29
28
  - Number of Evaluation Episodes: 10
29
  ## Hyperparameters
30
  ```python
 
17
  type: LunarLander-v2
18
  metrics:
19
  - type: mean_reward
20
+ value: -223.63 +/- 133.13
21
  name: mean_reward
22
  verified: false
23
  ---
24
  # PPO Agent Playing LunarLander-v2
25
  This is a trained model of a PPO agent playing LunarLander-v2.
26
  ## Evaluation Results
27
+ - Mean Reward: -223.63 ± 133.13
28
  - Number of Evaluation Episodes: 10
29
  ## Hyperparameters
30
  ```python
logs/events.out.tfevents.1750047698.2f208e49a865 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3d3a5a19b0779055fa417104f3eef84eb0bb7de7be90adc1e1b1be14069e3c84
3
+ size 433186
results.json CHANGED
@@ -1 +1 @@
1
- {"env_id": "LunarLander-v2", "mean_reward": -181.89562423336105, "std_reward": 126.2925732280044, "n_evaluation_episodes": 10, "eval_datetime": "2025-06-16T04:19:28.798775"}
 
1
+ {"env_id": "LunarLander-v2", "mean_reward": -223.6344459904147, "std_reward": 133.12613218024208, "n_evaluation_episodes": 10, "eval_datetime": "2025-06-16T04:22:31.860447"}