REINFORCE Agent playing Pixelcopter-PLE-v0

This repository contains a REINFORCE agent trained for Pixelcopter-PLE-v0 (Hugging Face Deep RL Course, Unit 4).

Evaluation

  • Episodes: 200
  • Max steps/episode: 500
  • Mean reward: 20.05
  • Std reward: 11.38

Artifacts:

  • model_state_dict.pt (PyTorch state_dict)
  • results.json (machine-readable evaluation)
  • replay.mp4 (sample rollout)
Downloads last month

-

Downloads are not tracked for this model. How to track
Video Preview
loading

Evaluation results