Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

mgfrantz
/
ppo-LunarLander-v2

Reinforcement Learning
stable-baselines3
LunarLander-v2
deep-reinforcement-learning
Eval Results (legacy)
Model card Files Files and versions
xet
Community
ppo-LunarLander-v2 / mikes_first_lander-1
143 kB
Ctrl+K
Ctrl+K
  • 1 contributor
History: 12 commits
mgfrantz's picture
mgfrantz
Trained LunarLander-v2-PPO-0 with a reduced learning rate by a factor of 10
a2102ab almost 4 years ago
  • _stable_baselines3_version
    5 Bytes
    This LunarLander-v2-PPO-0 is my first submission to the HF Hub! almost 4 years ago
  • data
    14.6 kB
    Trained LunarLander-v2-PPO-0 with a reduced learning rate by a factor of 10 almost 4 years ago
  • policy.optimizer.pth

    Detected Pickle imports (3)

    • "torch.FloatStorage",
    • "torch._utils._rebuild_tensor_v2",
    • "collections.OrderedDict"

    What is a pickle import?

    84.9 kB
    xet
    Trained LunarLander-v2-PPO-0 with a reduced learning rate by a factor of 10 almost 4 years ago
  • policy.pth

    Detected Pickle imports (3)

    • "torch._utils._rebuild_tensor_v2",
    • "collections.OrderedDict",
    • "torch.FloatStorage"

    What is a pickle import?

    43.2 kB
    xet
    Trained LunarLander-v2-PPO-0 with a reduced learning rate by a factor of 10 almost 4 years ago
  • pytorch_variables.pth

    Pickle imports

    • No problematic imports detected

    What is a pickle import?

    431 Bytes
    xet
    This LunarLander-v2-PPO-0 is my first submission to the HF Hub! almost 4 years ago
  • system_info.txt
    193 Bytes
    This LunarLander-v2-PPO-0 is my first submission to the HF Hub! almost 4 years ago