Actor-Critic Agent playing Pixelcopter-PLE-v0
This is a trained model of an Actor-Critic agent playing Pixelcopter-PLE-v0.
This model was trained as part of the Deep Reinforcement Learning Course.
Evaluation Results
- Mean Reward: 31.20 +/- 28.74
- Final Score (mean_reward - std_reward): 2.46