| license: mit | |
| tags: | |
| - reinforcement-learning | |
| - agent | |
| - ppo | |
| - pytorch | |
| - gymnasium | |
| # Reinforcement Learning in Robotic and Simulated Environments | |
| - Environment: Hopper-v5 | |
| - Algorithm: PPO | |
| - Steps: 1M | |
| - Average reward: 2200 | |
| - Network layer size: 256 | |
| Both .pth (for neural network) and .npz (for observation preprocessing) files need to be loaded in order to work correctly. |