ppo_second / results.json
zhanghwei's picture
test1
7926ba1
raw
history blame contribute delete
165 Bytes
{"mean_reward": -123.26049350851682, "std_reward": 50.36264513845567, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2023-05-22T17:26:17.971639"}