Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
LinasKo
/
RL-lander
like
0
Reinforcement Learning
stable-baselines3
LunarLander-v2
deep-reinforcement-learning
Eval Results (legacy)
Model card
Files
Files and versions
xet
Community
Use this model
main
RL-lander
/
lander-MlpPolicy-1
146 kB
1 contributor
History:
1 commit
LinasKo
The initial run of the lander, after training for 1M timestamps.
461dce9
about 3 years ago
_stable_baselines3_version
Safe
5 Bytes
The initial run of the lander, after training for 1M timestamps.
about 3 years ago
data
Safe
14.7 kB
The initial run of the lander, after training for 1M timestamps.
about 3 years ago
policy.optimizer.pth
87.9 kB
xet
The initial run of the lander, after training for 1M timestamps.
about 3 years ago
policy.pth
43.2 kB
xet
The initial run of the lander, after training for 1M timestamps.
about 3 years ago
pytorch_variables.pth
Safe
pickle
Pickle imports
No problematic imports detected
What is a pickle import?
431 Bytes
xet
The initial run of the lander, after training for 1M timestamps.
about 3 years ago
system_info.txt
Safe
184 Bytes
The initial run of the lander, after training for 1M timestamps.
about 3 years ago