| tags: | |
| - Pixelcopter-PLE-v0 | |
| - reinforce | |
| - reinforcement-learning | |
| - custom-implementation | |
| - deep-rl-course | |
| model-index: | |
| - name: PixelCopter-v1 | |
| results: | |
| - task: | |
| type: reinforcement-learning | |
| name: reinforcement-learning | |
| dataset: | |
| name: Pixelcopter-PLE-v0 | |
| type: Pixelcopter-PLE-v0 | |
| metrics: | |
| - type: mean_reward | |
| value: 20.045 | |
| name: mean_reward | |
| verified: false | |
| # REINFORCE Agent playing Pixelcopter-PLE-v0 | |
| This repository contains a **REINFORCE** agent trained for **Pixelcopter-PLE-v0** (Hugging Face Deep RL Course, Unit 4). | |
| ## Evaluation | |
| - Episodes: 200 | |
| - Max steps/episode: 500 | |
| - Mean reward: 20.05 | |
| - Std reward: 11.38 | |
| Artifacts: | |
| - `model_state_dict.pt` (PyTorch state_dict) | |
| - `results.json` (machine-readable evaluation) | |
| - `replay.mp4` (sample rollout) | |