DRL / results.json
azib's picture
PPO model for lunarlander environment
c552edf verified
{"mean_reward": 251.78434320000002, "std_reward": 19.958410349003753, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2025-06-08T17:36:26.252430"}