Policy Gradient agent (REINFORCE) for CartPole-v1

This repository contains a simple Policy Gradient (REINFORCE) agent implemented in PyTorch and trained on CartPole-v1 as part of the Hugging Face Deep Reinforcement Learning Course (Unit 4).

Files:

  • policy_ep2000.pt: trained model weights (state_dict).
  • pg_config.yml: training configuration (YAML).
Downloads last month

-

Downloads are not tracked for this model. How to track
Video Preview
loading

Evaluation results