ppo-LunarLander-v2 / results.json
Yusufhan's picture
Deep reinforcement learning algorithm using PPO
c82e914 verified
raw
history blame contribute delete
164 Bytes
{"mean_reward": 244.35020679999997, "std_reward": 27.55104126482555, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2024-08-17T11:32:37.808801"}