PPO LunarLander-v2 trained agent - batchsize 32, total_timestaps 4M 03cb757 verified polyconnect commited on May 19, 2024
PPO LunarLander-v2 trained agent - batchsize 32, total_timestaps 4M 72b7312 verified polyconnect commited on May 19, 2024