| --- |
| library_name: stable-baselines3 |
| tags: |
| - CartPole-v1 |
| - deep-reinforcement-learning |
| - reinforcement-learning |
| - stable-baselines3 |
| model-index: |
| - name: PPO |
| results: |
| - task: |
| type: reinforcement-learning |
| name: reinforcement-learning |
| dataset: |
| name: CartPole-v1 |
| type: CartPole-v1 |
| metrics: |
| - type: mean_reward |
| value: 500.00 +/- 0.00 |
| name: mean_reward |
| verified: false |
| --- |
| |
| # **PPO** Agent playing **CartPole-v1** |
| This is a trained model of a **PPO** agent playing **CartPole-v1** |
| using the [stable-baselines3 library](https://github.com/DLR-RM/stable-baselines3). |