Update README.md
Browse files
README.md
CHANGED
|
@@ -24,7 +24,7 @@ model-index:
|
|
| 24 |
|
| 25 |
# Actor-Critic Agent Playing PandaReachDense-v3
|
| 26 |
|
| 27 |
-
This is a trained model of an
|
| 28 |
|
| 29 |
# Hyperparameters
|
| 30 |
hp_seed: 2444<br />hp_torch_deterministic: True<br />hp_total_timesteps: 20500<br />hp_critic_nstep: 1<br />hp_num_envs: 12<br />hp_learning_rate_actor: 0.001<br />hp_learning_rate_critic: 0.005<br />hp_minlr_actor: 2e-06<br />hp_minlr_critic: 1e-05<br />hp_gamma: 0.99<br />hp_reg_term: 3<br />hp_batch_size: 64
|
|
|
|
| 24 |
|
| 25 |
# Actor-Critic Agent Playing PandaReachDense-v3
|
| 26 |
|
| 27 |
+
This is a trained model of an A2C agent playing PandaReachDense-v3.
|
| 28 |
|
| 29 |
# Hyperparameters
|
| 30 |
hp_seed: 2444<br />hp_torch_deterministic: True<br />hp_total_timesteps: 20500<br />hp_critic_nstep: 1<br />hp_num_envs: 12<br />hp_learning_rate_actor: 0.001<br />hp_learning_rate_critic: 0.005<br />hp_minlr_actor: 2e-06<br />hp_minlr_critic: 1e-05<br />hp_gamma: 0.99<br />hp_reg_term: 3<br />hp_batch_size: 64
|