Reinforce Agent playing CartPole-v1

Trained with a custom PyTorch REINFORCE implementation.

Evaluation

  • Mean reward: 489.70
  • Std reward: 39.39
  • Score (mean - std): 450.31
Downloads last month

-

Downloads are not tracked for this model. How to track
Video Preview
loading

Evaluation results