Youtaaa commited on
Commit
42b44e2
·
verified ·
1 Parent(s): dd037e0

Upload folder using huggingface_hub

Browse files
Files changed (4) hide show
  1. README.md +27 -3
  2. hyperparameters.json +11 -1
  3. model.pt +2 -2
  4. results.json +1 -1
README.md CHANGED
@@ -16,9 +16,33 @@ model-index:
16
  type: Pixelcopter-PLE-v0
17
  metrics:
18
  - type: mean_reward
19
- value: 8.50 +/- 10.00
20
  name: mean_reward
21
  verified: false
22
  ---
23
- # Reinforce Agent playing Pixelcopter-PLE-v0
24
- Unit 4 Deep RL Course.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
16
  type: Pixelcopter-PLE-v0
17
  metrics:
18
  - type: mean_reward
19
+ value: 27.50 +/- 21.88
20
  name: mean_reward
21
  verified: false
22
  ---
23
+
24
+ # REINFORCE Agent - PixelCopter-PLE-v0
25
+
26
+ Trained with the REINFORCE algorithm.
27
+
28
+ ## Results
29
+
30
+ | Mean reward | Std reward |
31
+ |-------------|------------|
32
+ | 27.50 | 21.88 |
33
+
34
+ ## Hyperparameters
35
+
36
+ ```json
37
+ {
38
+ "h_size": 64,
39
+ "n_training_episodes": 20000,
40
+ "n_evaluation_episodes": 10,
41
+ "max_t": 10000,
42
+ "gamma": 0.99,
43
+ "lr": 0.0001,
44
+ "env_id": "Pixelcopter-PLE-v0",
45
+ "state_space": 7,
46
+ "action_space": 2
47
+ }
48
+ ```
hyperparameters.json CHANGED
@@ -1 +1,11 @@
1
- {"h_size": 64, "n_training_episodes": 20000, "n_evaluation_episodes": 10, "max_t": 10000, "gamma": 0.99, "lr": 0.0001, "env_id": "Pixelcopter-PLE-v0", "state_space": 7, "action_space": 2}
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "h_size": 64,
3
+ "n_training_episodes": 20000,
4
+ "n_evaluation_episodes": 10,
5
+ "max_t": 10000,
6
+ "gamma": 0.99,
7
+ "lr": 0.0001,
8
+ "env_id": "Pixelcopter-PLE-v0",
9
+ "state_space": 7,
10
+ "action_space": 2
11
+ }
model.pt CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:2ee1b7ef565bb6b4aee611a5581b7db96a6486ba13ad522ee68ae7aed23a88e6
3
- size 40125
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b868f80f3eb62650786bac19b5fc26ca7e6efc8cdc414b17edb603f4e699ef86
3
+ size 40189
results.json CHANGED
@@ -1 +1 @@
1
- {"env_id": "Pixelcopter-PLE-v0", "mean_reward": 8.5, "n_evaluation_episodes": 10, "eval_datetime": "2026-03-18T22:14:44.443965"}
 
1
+ {"env_id": "Pixelcopter-PLE-v0", "mean_reward": 27.5, "std_reward": 21.88}