It seems that DQN performs the worst if trained for 1e6 timesteps. But it did train quicker, taking about 17 min, as opposed to 20-22. 3f14e63 LinasKo commited on Dec 15, 2022
The initial run of the lander, after training for 1M timestamps. 461dce9 LinasKo commited on Dec 14, 2022