RL-lander / DQN-1e6 /_stable_baselines3_version
LinasKo's picture
It seems that DQN performs the worst if trained for 1e6 timesteps. But it did train quicker, taking about 17 min, as opposed to 20-22.
3f14e63
1.6.2