Update README.md
Browse files
README.md
CHANGED
|
@@ -24,8 +24,8 @@ model-index:
|
|
| 24 |
# **Reinforce** Agent playing **CartPole-v1**
|
| 25 |
This is a trained model of a custom **Reinforce** agent playing **CartPole-v1**.
|
| 26 |
|
| 27 |
-
This was created for [Unit 4](https://huggingface.co/learn/deep-rl-course/unit4/introduction) of the Hugging Face Deep RL Course. I have added some features, such as entropy
|
| 28 |
-
loss, and updating the code for Gymnasium versus the deprecated Gym
|
| 29 |
|
| 30 |
Hyperparameters used to train were optimized by [Optuna](https://pypi.org/project/optuna/)
|
| 31 |
|
|
|
|
| 24 |
# **Reinforce** Agent playing **CartPole-v1**
|
| 25 |
This is a trained model of a custom **Reinforce** agent playing **CartPole-v1**.
|
| 26 |
|
| 27 |
+
This was created for [Unit 4](https://huggingface.co/learn/deep-rl-course/unit4/introduction) of the Hugging Face Deep RL Course. I have added some features, such as [entropy
|
| 28 |
+
loss](https://www.mathworks.com/help/reinforcement-learning/ug/reinforce-policy-gradient-agents.html), and updating the code for Gymnasium versus the deprecated Gym
|
| 29 |
|
| 30 |
Hyperparameters used to train were optimized by [Optuna](https://pypi.org/project/optuna/)
|
| 31 |
|