ppo-LunarLander-v2-v4 / results.json
Aditya Hemant Majali
Push PPO trained LunarLander agent v2
aef361d
raw
history blame contribute delete
164 Bytes
{"mean_reward": 266.1479891339534, "std_reward": 19.334277329233842, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2023-03-26T18:25:46.670062"}