RL-lander / README.md

Commit History

It seems that DQN performs the worst if trained for 1e6 timesteps. But it did train quicker, taking about 17 min, as opposed to 20-22.
3f14e63

LinasKo commited on

A2C trained on LunarLander-v2 for 1e6 timesteps
9958b53

LinasKo commited on

Trained on my local machine.
1a41a8f

LinasKo commited on

The initial run of the lander, after training for 1M timestamps.
461dce9

LinasKo commited on