Reinforcement Learning
stable-baselines3
Safetensors
enduro
atari
sb3
ppo
control
Eval Results (legacy)
Instructions to use lucasschott/Enduro-v5-PPO with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- stable-baselines3
How to use lucasschott/Enduro-v5-PPO with stable-baselines3:
from huggingface_sb3 import load_from_hub checkpoint = load_from_hub( repo_id="lucasschott/Enduro-v5-PPO", filename="{MODEL FILENAME}.zip", ) - Notebooks
- Google Colab
- Kaggle
PPO Agent for Enduro-v5
This is a Proximal Policy Optimization (PPO) agent trained on the Enduro-v5 environment using Stable Baselines 3.
Hyperparameters
See config.json for details.
Requirements
- Python: 3.10
Dependencies
gymnasium==1.0.0
ale_py==0.10.1
gymnasium[atari]
torch==2.4.0
stable_baselines3==2.4.1
opencv-python==25.0.1
How to Load
from huggingface_hub import hf_hub_download
from stable_baselines3 import PPO
model_path = hf_hub_download(repo_id="lucasschott/Enduro-v5-PPO", filename="model.zip")
agent = PPO.load(model_path)
- Downloads last month
- 5
Evaluation results
- mean_reward on Enduro-v5self-reported599.54 +/- 131.49