ppo-LunarLander-v2 / results.json
0xSingletOnly's picture
upload trained ppo model
202b57d verified
{"mean_reward": 269.5760596, "std_reward": 23.46699423696974, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2025-05-26T13:00:29.369595"}