mblond commited on
Commit
46bddff
·
verified ·
1 Parent(s): e7b6eb7

Upload folder using huggingface_hub

Browse files
Files changed (4) hide show
  1. README.md +5 -9
  2. hyperparameters.json +1 -1
  3. model.pth +1 -1
  4. replay.mp4 +0 -0
README.md CHANGED
@@ -1,12 +1,11 @@
1
  ---
2
  tags:
3
- - Pixelcopter-PLE-v0
4
  - actor-critic
5
  - reinforcement-learning
6
- - custom-implementation
7
  - deep-rl-class
 
8
  model-index:
9
- - name: ActorCritic-Pixelcopter-PLE-v0
10
  results:
11
  - task:
12
  type: reinforcement-learning
@@ -16,17 +15,14 @@ model-index:
16
  type: Pixelcopter-PLE-v0
17
  metrics:
18
  - type: mean_reward
19
- value: 41.45 +/- 39.02
20
  name: mean_reward
21
  verified: false
22
  ---
23
 
24
  # **Actor-Critic** Agent playing **Pixelcopter-PLE-v0**
25
-
26
  This is a trained model of an **Actor-Critic** agent playing **Pixelcopter-PLE-v0**.
27
  This model was trained as part of the Deep Reinforcement Learning Course.
28
-
29
  ## Evaluation Results
30
-
31
- - **Mean Reward:** 41.45 +/- 39.02
32
- - **Final Score (mean_reward - std_reward):** 2.43
 
1
  ---
2
  tags:
 
3
  - actor-critic
4
  - reinforcement-learning
 
5
  - deep-rl-class
6
+ - Pixelcopter-PLE-v0
7
  model-index:
8
+ - name: ActorCritic-Pixelcopter
9
  results:
10
  - task:
11
  type: reinforcement-learning
 
15
  type: Pixelcopter-PLE-v0
16
  metrics:
17
  - type: mean_reward
18
+ value: 31.84 +/- 22.63
19
  name: mean_reward
20
  verified: false
21
  ---
22
 
23
  # **Actor-Critic** Agent playing **Pixelcopter-PLE-v0**
 
24
  This is a trained model of an **Actor-Critic** agent playing **Pixelcopter-PLE-v0**.
25
  This model was trained as part of the Deep Reinforcement Learning Course.
 
26
  ## Evaluation Results
27
+ - **Mean Reward:** 31.84 +/- 22.63
28
+ - **Final Score (mean_reward - std_reward):** 9.21
 
hyperparameters.json CHANGED
@@ -1 +1 @@
1
- {"h_size": 256, "n_training_episodes": 20000, "n_evaluation_episodes": 20, "max_t": 1000, "gamma": 0.99, "lr": 0.0001, "env_id": "Pixelcopter-PLE-v0"}
 
1
+ {"h_size": 256, "n_training_episodes": 20000, "n_evaluation_episodes": 25, "max_t": 1000, "gamma": 0.99, "lr": 0.0001, "env_id": "Pixelcopter-PLE-v0"}
model.pth CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:102bf27677b81c5ab63a21c0187ae9d88e28de7ee7907292477b9698910a7b99
3
  size 277224
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:fc7cd031726ed5f9f5e6433fe1ddb47545dfc11089fec9a679d9d07e54dacd84
3
  size 277224
replay.mp4 CHANGED
Binary files a/replay.mp4 and b/replay.mp4 differ