metapat973
/

Reinforce-Pixelcopter-PLE-v0

Reinforcement Learning

Pixelcopter-PLE-v0

policy-gradient

custom-implementation

Eval Results (legacy)

Model card Files Files and versions

REINFORCE Agent on Pixelcopter-PLE-v0

This repository contains a REINFORCE (policy gradient) agent trained on Pixelcopter-PLE-v0.

Evaluation

Mean reward: 48.95 ± 42.79
Episodes: 20

Algorithm

Monte Carlo Policy Gradient
Stochastic policy
PyTorch implementation

Downloads last month: -; Downloads are not tracked for this model. How to track

Video Preview

Reinforcement Learning

loading

Evaluation results

mean_reward on Pixelcopter-PLE-v0
self-reported

48.95 +/- 42.79