Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
LinasKo
/
RL-lander
like
0
Reinforcement Learning
stable-baselines3
LunarLander-v2
deep-reinforcement-learning
Eval Results (legacy)
Model card
Files
Files and versions
xet
Community
Use this model
main
RL-lander
1.24 MB
Ctrl+K
Ctrl+K
1 contributor
History:
5 commits
LinasKo
It seems that DQN performs the worst if trained for 1e6 timesteps. But it did train quicker, taking about 17 min, as opposed to 20-22.
3f14e63
over 3 years ago
DQN-1e6
It seems that DQN performs the worst if trained for 1e6 timesteps. But it did train quicker, taking about 17 min, as opposed to 20-22.
over 3 years ago
a2c-1e6
A2C trained on LunarLander-v2 for 1e6 timesteps
over 3 years ago
lander-MlpPolicy-1
The initial run of the lander, after training for 1M timestamps.
over 3 years ago
ppo-1e6
Trained on my local machine.
over 3 years ago
.gitattributes
Safe
1.48 kB
initial commit
over 3 years ago
DQN-1e6.zip
110 kB
xet
It seems that DQN performs the worst if trained for 1e6 timesteps. But it did train quicker, taking about 17 min, as opposed to 20-22.
over 3 years ago
README.md
Safe
784 Bytes
It seems that DQN performs the worst if trained for 1e6 timesteps. But it did train quicker, taking about 17 min, as opposed to 20-22.
over 3 years ago
a2c-1e6.zip
102 kB
xet
A2C trained on LunarLander-v2 for 1e6 timesteps
over 3 years ago
config.json
Safe
19.4 kB
It seems that DQN performs the worst if trained for 1e6 timesteps. But it did train quicker, taking about 17 min, as opposed to 20-22.
over 3 years ago
lander-MlpPolicy-1.zip
147 kB
xet
The initial run of the lander, after training for 1M timestamps.
over 3 years ago
ppo-1e6.zip
148 kB
xet
Trained on my local machine.
over 3 years ago
replay.mp4
Safe
207 kB
It seems that DQN performs the worst if trained for 1e6 timesteps. But it did train quicker, taking about 17 min, as opposed to 20-22.
over 3 years ago
results.json
Safe
165 Bytes
It seems that DQN performs the worst if trained for 1e6 timesteps. But it did train quicker, taking about 17 min, as opposed to 20-22.
over 3 years ago