Actor-Critic Agent playing Pixelcopter-PLE-v0
This is a trained model of an Actor-Critic agent playing Pixelcopter-PLE-v0.
This model was trained as part of the Deep Reinforcement Learning Course.
Evaluation Results
- Mean Reward: 31.84 +/- 22.63
- Final Score (mean_reward - std_reward): 9.21