| tags: | |
| - ppo | |
| - LunarLander-v2 | |
| - deep-reinforcement-learning | |
| - reinforcement-learning | |
| - custom-implementation | |
| - deep-rl-course | |
| model-index: | |
| - name: PPO-LunarLander-v2 | |
| results: | |
| - task: | |
| type: reinforcement-learning | |
| name: reinforcement-learning | |
| dataset: | |
| name: LunarLander-v2 | |
| type: LunarLander-v2 | |
| metrics: | |
| - type: mean_reward | |
| value: 250.0 # Make sure this represents your actual performance | |
| name: mean_reward | |
| # PPO Agent for LunarLander-v2 | |
| This is a trained model of a PPO agent for LunarLander-v2. | |
| It was trained as part of the [Hugging Face Deep RL Course](https://huggingface.co/learn/deep-rl-course/unit8/introduction). | |