Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

threite
/
ppo-LunarLander-v2

Reinforcement Learning
stable-baselines3
LunarLander-v2
deep-reinforcement-learning
Eval Results (legacy)
Model card Files Files and versions
xet
Community
ppo-LunarLander-v2 / myFirstRLModel
146 kB
  • 1 contributor
History: 1 commit
threite's picture
threite
Trained Model after 1_000_000 timesteps
fd0ebdc about 3 years ago
  • _stable_baselines3_version
    5 Bytes
    Trained Model after 1_000_000 timesteps about 3 years ago
  • data
    14.7 kB
    Trained Model after 1_000_000 timesteps about 3 years ago
  • policy.optimizer.pth
    87.9 kB
    xet
    Trained Model after 1_000_000 timesteps about 3 years ago
  • policy.pth
    43.2 kB
    xet
    Trained Model after 1_000_000 timesteps about 3 years ago
  • pytorch_variables.pth
    431 Bytes
    xet
    Trained Model after 1_000_000 timesteps about 3 years ago
  • system_info.txt
    184 Bytes
    Trained Model after 1_000_000 timesteps about 3 years ago