metadata
tags:
- Pixelcopter-PLE-v0
- reinforce
- reinforcement-learning
- custom-implementation
- deep-rl-course
model-index:
- name: PixelCopter-v1
results:
- task:
type: reinforcement-learning
name: reinforcement-learning
dataset:
name: Pixelcopter-PLE-v0
type: Pixelcopter-PLE-v0
metrics:
- type: mean_reward
value: 20.045
name: mean_reward
verified: false
REINFORCE Agent playing Pixelcopter-PLE-v0
This repository contains a REINFORCE agent trained for Pixelcopter-PLE-v0 (Hugging Face Deep RL Course, Unit 4).
Evaluation
- Episodes: 200
- Max steps/episode: 500
- Mean reward: 20.05
- Std reward: 11.38
Artifacts:
model_state_dict.pt(PyTorch state_dict)results.json(machine-readable evaluation)replay.mp4(sample rollout)