--- license: mit tags: - reinforcement-learning - agent - ppo - pytorch - gymnasium --- # Reinforcement Learning in Robotic and Simulated Environments - Environment: Hopper-v5 - Algorithm: PPO - Steps: 1M - Average reward: 2200 - Network layer size: 256 Both .pth (for neural network) and .npz (for observation preprocessing) files need to be loaded in order to work correctly.