Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
LinasKo
/
RL-lander
like
0
Reinforcement Learning
stable-baselines3
LunarLander-v2
deep-reinforcement-learning
Eval Results (legacy)
Model card
Files
Files and versions
xet
Community
Use this model
main
RL-lander
/
DQN-1e6
/
_stable_baselines3_version
LinasKo
It seems that DQN performs the worst if trained for 1e6 timesteps. But it did train quicker, taking about 17 min, as opposed to 20-22.
3f14e63
about 3 years ago
raw
Copy download link
history
blame
contribute
delete
5 Bytes
1
.
6
.
2