---
license: mit
tags:
- reinforcement-learning
- agent
- ppo
- pytorch
- gymnasium
---

# Reinforcement Learning in Robotic and Simulated Environments
- Environment: Hopper-v5
- Algorithm: PPO
- Steps: 1M
- Average reward: 2200
- Network layer size: 256

Both .pth (for neural network) and .npz (for observation preprocessing) files need to be loaded in order to work correctly.