forgedRice
/

Reinforce-Pixelcopter-PLE-v0

Reinforcement Learning

Pixelcopter-PLE-v0

custom-implementation

Eval Results (legacy)

Model card Files Files and versions

Reinforce Agent playing Pixelcopter-PLE-v0

This is a trained model of a Reinforce agent playing Pixelcopter-PLE-v0.

Model Details

Algorithm: REINFORCE (Monte Carlo Policy Gradient)
Environment: Pixelcopter-PLE-v0
Mean Reward: 16.20

To learn to use this model and train yours check Unit 4 of the Deep Reinforcement Learning Course: https://huggingface.co/deep-rl-course/unit4/introduction

Downloads last month: -; Downloads are not tracked for this model. How to track

Video Preview

Reinforcement Learning

loading

Evaluation results

mean_reward on Pixelcopter-PLE-v0
self-reported

16.20