PPO Agent for Enduro-v5

This is a Proximal Policy Optimization (PPO) agent trained on the Enduro-v5 environment using Stable Baselines 3.

Hyperparameters

See config.json for details.

Requirements

Python: 3.10

Dependencies

gymnasium==1.0.0
ale_py==0.10.1
gymnasium[atari]
torch==2.4.0
stable_baselines3==2.4.1
opencv-python==25.0.1

How to Load

from huggingface_hub import hf_hub_download
from stable_baselines3 import PPO

model_path = hf_hub_download(repo_id="lucasschott/Enduro-v5-PPO", filename="model.zip")
agent = PPO.load(model_path)

Downloads last month: 2

Safetensors

Model size

2.24M params

Tensor type

F32

Video Preview

Reinforcement Learning

Evaluation results

mean_reward on Enduro-v5
self-reported

599.54 +/- 131.49