Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
alanwsx
/
Rl-Unit1-CoLab
like
0
Reinforcement Learning
stable-baselines3
LunarLander-v2
deep-reinforcement-learning
Eval Results (legacy)
Model card
Files
Files and versions
xet
Community
Use this model
main
Rl-Unit1-CoLab
/
ppo-LunarLander-v2
147 kB
1 contributor
History:
1 commit
alanwsx
First try of Unit 1 CoLab with PPO trained with 1 million steps
33ba93c
verified
about 1 year ago
_stable_baselines3_version
7 Bytes
First try of Unit 1 CoLab with PPO trained with 1 million steps
about 1 year ago
data
14 kB
First try of Unit 1 CoLab with PPO trained with 1 million steps
about 1 year ago
policy.optimizer.pth
88.4 kB
xet
First try of Unit 1 CoLab with PPO trained with 1 million steps
about 1 year ago
policy.pth
43.8 kB
xet
First try of Unit 1 CoLab with PPO trained with 1 million steps
about 1 year ago
pytorch_variables.pth
864 Bytes
xet
First try of Unit 1 CoLab with PPO trained with 1 million steps
about 1 year ago
system_info.txt
263 Bytes
First try of Unit 1 CoLab with PPO trained with 1 million steps
about 1 year ago