PPO Agent playing LunarLander-v3
This is a trained model of a PPO agent playing LunarLander-v3 using the stable-baselines3 library.
Use of Deep RL library Stable Baselines3
Model description
A Lunar Lander agent, train to learn to adapt its peed and position to land on the moon.
from stable_baselines3 import ...
from huggingface_sb3 import load_from_hub
...
- Downloads last month
- 7
Evaluation results
- mean_reward on LunarLander-v3self-reported250.71 +/- 18.42