Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Battu007
/
V4_PPO2_LunarLander_v2

Reinforcement Learning
stable-baselines3
LunarLander-v2
deep-reinforcement-learning
Eval Results (legacy)
Model card Files Files and versions
xet
Community
V4_PPO2_LunarLander_v2 / V4_PPO_LL
146 kB
  • 2 contributors
History: 1 commit
ASBattu
PPO Hyperparemeter tune 1M steps LL-2 agent
825d67c over 3 years ago
  • _stable_baselines3_version
    5 Bytes
    PPO Hyperparemeter tune 1M steps LL-2 agent over 3 years ago
  • data
    17.5 kB
    PPO Hyperparemeter tune 1M steps LL-2 agent over 3 years ago
  • policy.optimizer.pth

    Detected Pickle imports (3)

    • "torch._utils._rebuild_tensor_v2",
    • "collections.OrderedDict",
    • "torch.FloatStorage"

    What is a pickle import?

    84.6 kB
    xet
    PPO Hyperparemeter tune 1M steps LL-2 agent over 3 years ago
  • policy.pth

    Detected Pickle imports (3)

    • "torch._utils._rebuild_tensor_v2",
    • "collections.OrderedDict",
    • "torch.FloatStorage"

    What is a pickle import?

    43.1 kB
    xet
    PPO Hyperparemeter tune 1M steps LL-2 agent over 3 years ago
  • pytorch_variables.pth

    Pickle imports

    • No problematic imports detected

    What is a pickle import?

    431 Bytes
    xet
    PPO Hyperparemeter tune 1M steps LL-2 agent over 3 years ago
  • system_info.txt
    146 Bytes
    PPO Hyperparemeter tune 1M steps LL-2 agent over 3 years ago