Reinforce Agent playing Pixelcopter-PLE-v0

This is a trained model of a Reinforce agent playing Pixelcopter-PLE-v0.

Model Details

  • Algorithm: REINFORCE (Monte Carlo Policy Gradient)
  • Environment: Pixelcopter-PLE-v0
  • Mean Reward: 16.20

To learn to use this model and train yours check Unit 4 of the Deep Reinforcement Learning Course: https://huggingface.co/deep-rl-course/unit4/introduction

Downloads last month

-

Downloads are not tracked for this model. How to track
Video Preview
loading

Evaluation results