File size: 818 Bytes
2515257
 
 
 
 
 
ad91bfb
2515257
 
 
 
 
 
 
 
 
 
 
f4e5c67
2515257
 
 
f4e5c67
2515257
f4e5c67
ad91bfb
f4e5c67
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
---
tags:
- Pixelcopter-PLE-v0
- reinforce
- reinforcement-learning
- custom-implementation
- deep-rl-course
model-index:
- name: PixelCopter-v1
  results:
  - task:
      type: reinforcement-learning
      name: reinforcement-learning
    dataset:
      name: Pixelcopter-PLE-v0
      type: Pixelcopter-PLE-v0
    metrics:
    - type: mean_reward
      value: 20.045
      name: mean_reward
      verified: false
---
# REINFORCE Agent playing Pixelcopter-PLE-v0

This repository contains a **REINFORCE** agent trained for **Pixelcopter-PLE-v0** (Hugging Face Deep RL Course, Unit 4).

## Evaluation
- Episodes: 200
- Max steps/episode: 500
- Mean reward: 20.05
- Std reward: 11.38

Artifacts:
- `model_state_dict.pt` (PyTorch state_dict)
- `results.json` (machine-readable evaluation)
- `replay.mp4` (sample rollout)