RL-lander / a2c-1e6 /policy.optimizer.pth

Commit History

A2C trained on LunarLander-v2 for 1e6 timesteps
9958b53

LinasKo commited on