Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
sam133
/
ppo-LunarLander-v2
like
0
Reinforcement Learning
stable-baselines3
LunarLander-v2
deep-reinforcement-learning
Eval Results (legacy)
Model card
Files
Files and versions
xet
Community
Use this model
main
ppo-LunarLander-v2
493 kB
1 contributor
History:
2 commits
sam133
Model trained for 10 million timesteps with mean_reward=286.17
f923604
about 3 years ago
ppo-LunarLander-v2-T2
Model trained for 10 million timesteps with mean_reward=286.17
about 3 years ago
.gitattributes
1.48 kB
initial commit
about 3 years ago
README.md
784 Bytes
Model trained for 10 million timesteps with mean_reward=286.17
about 3 years ago
config.json
14.4 kB
Model trained for 10 million timesteps with mean_reward=286.17
about 3 years ago
ppo-LunarLander-v2-T2.zip
147 kB
xet
Model trained for 10 million timesteps with mean_reward=286.17
about 3 years ago
replay.mp4
183 kB
Model trained for 10 million timesteps with mean_reward=286.17
about 3 years ago
results.json
165 Bytes
Model trained for 10 million timesteps with mean_reward=286.17
about 3 years ago