Reinforce Agent playing Pixelcopter-PLE-v0

Trained with REINFORCE policy gradient.

Downloads last month

-

Downloads are not tracked for this model. How to track
Video Preview
loading

Evaluation results