Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

GuanOrg
/
DeepRLCourse2022

Reinforcement Learning
stable-baselines3
LunarLander-v2
deep-reinforcement-learning
Eval Results (legacy)
Model card Files Files and versions
xet
Community

Instructions to use GuanOrg/DeepRLCourse2022 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

  • Libraries
  • stable-baselines3

    How to use GuanOrg/DeepRLCourse2022 with stable-baselines3:

    from huggingface_sb3 import load_from_hub
    checkpoint = load_from_hub(
    	repo_id="GuanOrg/DeepRLCourse2022",
    	filename="{MODEL FILENAME}.zip",
    )
  • Notebooks
  • Google Colab
  • Kaggle
DeepRLCourse2022
1.12 MB
Ctrl+K
Ctrl+K
  • 1 contributor
History: 4 commits
bguan's picture
bguan
bguan's lunar lander model #3 using PPO trained for 1M timesteps
ee17131 about 4 years ago
  • bguan_ppo_lunarlander
    bguan's lunar lander model using PPO trained for 500K timesteps about 4 years ago
  • bguan_ppo_lunarlander2
    bguan's lunar lander model #2 using PPO trained for 500K timesteps about 4 years ago
  • bguan_ppo_lunarlander3
    bguan's lunar lander model #3 using PPO trained for 1M timesteps about 4 years ago
  • .gitattributes
    1.22 kB
    bguan's lunar lander model using PPO trained for 500K timesteps about 4 years ago
  • README.md
    677 Bytes
    bguan's lunar lander model #3 using PPO trained for 1M timesteps about 4 years ago
  • bguan_ppo_lunarlander.zip
    144 kB
    xet
    bguan's lunar lander model using PPO trained for 500K timesteps about 4 years ago
  • bguan_ppo_lunarlander2.zip
    144 kB
    xet
    bguan's lunar lander model #2 using PPO trained for 500K timesteps about 4 years ago
  • bguan_ppo_lunarlander3.zip
    144 kB
    xet
    bguan's lunar lander model #3 using PPO trained for 1M timesteps about 4 years ago
  • config.json
    14.4 kB
    bguan's lunar lander model #3 using PPO trained for 1M timesteps about 4 years ago
  • replay.mp4
    245 kB
    xet
    bguan's lunar lander model #3 using PPO trained for 1M timesteps about 4 years ago
  • results.json
    165 Bytes
    bguan's lunar lander model #3 using PPO trained for 1M timesteps about 4 years ago