Reinforce Agent playing CartPole-v1 This is a trained model of a Reinforce agent playing CartPole-v1. Trained for 2000 episodes.
Reference:
- Unit 4 of the Deep Reinforcement Learning Course: https://huggingface.co/deep-rl-course/unit4/introduction
Evaluation results
- mean_reward on CartPole-v1self-reported170.30 +/- 9.72