ppo-LunarLander-v2 / results.json
mhmdmuhammad's picture
new trained PPO moel
fd6bba1
raw
history blame contribute delete
163 Bytes
{"mean_reward": 265.3857363575706, "std_reward": 24.31124025279481, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2022-12-24T11:01:40.475174"}