Hopper RL

This model was trained with PPO in the seals/Hopper-v1 environment

Downloads last month
2
Video Preview
loading

Evaluation results