Hopper RL

This model was trained with PPO in the seals/Hopper-v1 environment

Downloads last month
-
Video Preview
loading

Evaluation results