liajun
/

reinforce-CartPole-v1

Reinforcement Learning

custom-implementation

Eval Results (legacy)

Model card Files Files and versions

liajun commited on Nov 24, 2025

Commit

a4e65f9

·

verified ·

1 Parent(s): 6aea0e8

Update README.md

Files changed (1) hide show

README.md +32 -10

README.md CHANGED Viewed

@@ -1,10 +1,32 @@
-# Policy Gradient agent (REINFORCE) for CartPole-v1
-This repository contains a simple Policy Gradient (REINFORCE) agent
-implemented in PyTorch and trained on **CartPole-v1**.
-Files:
-- `policy_ep2000.pt`: trained model weights (state_dict).
-- `pg_config.yml`: training configuration (YAML).
-This code follows Unit 4 of the Hugging Face Deep RL Course.

+---
+tags:
+- CartPole-v1
+- reinforce
+- reinforcement-learning
+- custom-implementation
+- deep-rl-class
+model-index:
+- name: reinforce-CartPole-v1
+  results:
+  - task:
+      name: reinforcement-learning
+      type: reinforcement-learning
+    dataset:
+      name: CartPole-v1
+      type: CartPole-v1
+    metrics:
+    - name: mean_reward
+      type: mean_reward
+      value: 500.0 +/- 0.0
+---
+# Policy Gradient agent (REINFORCE) for CartPole-v1
+This repository contains a simple Policy Gradient (REINFORCE) agent
+implemented in PyTorch and trained on **CartPole-v1** as part of the
+Hugging Face Deep Reinforcement Learning Course (Unit 4).
+Files:
+- `policy_ep2000.pt`: trained model weights (state_dict).
+- `pg_config.yml`: training configuration (YAML).