ppo-LunarLanger-v2 / results.json
Suprim003's picture
Trained LumarLanger-v2 model with PPO
912181d verified
{"mean_reward": 253.2969367, "std_reward": 19.7052094885131, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2025-07-15T06:18:01.362935"}