liajun commited on
Commit
a4e65f9
·
verified ·
1 Parent(s): 6aea0e8

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +32 -10
README.md CHANGED
@@ -1,10 +1,32 @@
1
- # Policy Gradient agent (REINFORCE) for CartPole-v1
2
-
3
- This repository contains a simple Policy Gradient (REINFORCE) agent
4
- implemented in PyTorch and trained on **CartPole-v1**.
5
-
6
- Files:
7
- - `policy_ep2000.pt`: trained model weights (state_dict).
8
- - `pg_config.yml`: training configuration (YAML).
9
-
10
- This code follows Unit 4 of the Hugging Face Deep RL Course.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - CartPole-v1
4
+ - reinforce
5
+ - reinforcement-learning
6
+ - custom-implementation
7
+ - deep-rl-class
8
+ model-index:
9
+ - name: reinforce-CartPole-v1
10
+ results:
11
+ - task:
12
+ name: reinforcement-learning
13
+ type: reinforcement-learning
14
+ dataset:
15
+ name: CartPole-v1
16
+ type: CartPole-v1
17
+ metrics:
18
+ - name: mean_reward
19
+ type: mean_reward
20
+ value: 500.0 +/- 0.0
21
+ ---
22
+
23
+ # Policy Gradient agent (REINFORCE) for CartPole-v1
24
+
25
+ This repository contains a simple Policy Gradient (REINFORCE) agent
26
+ implemented in PyTorch and trained on **CartPole-v1** as part of the
27
+ Hugging Face Deep Reinforcement Learning Course (Unit 4).
28
+
29
+ Files:
30
+
31
+ - `policy_ep2000.pt`: trained model weights (state_dict).
32
+ - `pg_config.yml`: training configuration (YAML).