REINFORCE Agent playing Pixelcopter-PLE-v0
This repository contains a REINFORCE agent trained for Pixelcopter-PLE-v0 (Hugging Face Deep RL Course, Unit 4).
Evaluation
- Episodes: 200
- Max steps/episode: 500
- Mean reward: 20.05
- Std reward: 11.38
Artifacts:
model_state_dict.pt(PyTorch state_dict)results.json(machine-readable evaluation)replay.mp4(sample rollout)
Evaluation results
- mean_reward on Pixelcopter-PLE-v0self-reported20.045