LinasKo
/

RL-lander

Reinforcement Learning

stable-baselines3

deep-reinforcement-learning

Eval Results (legacy)

Model card Files Files and versions

RL-lander / README.md

Commit History

It seems that DQN performs the worst if trained for 1e6 timesteps. But it did train quicker, taking about 17 min, as opposed to 20-22.

3f14e63

LinasKo commited on Dec 15, 2022

A2C trained on LunarLander-v2 for 1e6 timesteps

9958b53

LinasKo commited on Dec 15, 2022

Trained on my local machine.

1a41a8f

LinasKo commited on Dec 15, 2022

The initial run of the lander, after training for 1M timestamps.

461dce9

LinasKo commited on Dec 14, 2022