reeeemo
/

Reinforce-CartPole-v1

Reinforcement Learning

custom-implementation

Eval Results (legacy)

Model card Files Files and versions

reeeemo commited on Dec 29, 2025

Commit

a48f656

·

verified ·

1 Parent(s): 848765a

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -24,8 +24,8 @@ model-index:
 # **Reinforce** Agent playing **CartPole-v1**
 This is a trained model of a custom **Reinforce** agent playing **CartPole-v1**.
-This was created for [Unit 4](https://huggingface.co/learn/deep-rl-course/unit4/introduction) of the Hugging Face Deep RL Course. I have added some features, such as entropy
-loss, and updating the code for Gymnasium versus the deprecated Gym
 Hyperparameters used to train were optimized by [Optuna](https://pypi.org/project/optuna/)

 # **Reinforce** Agent playing **CartPole-v1**
 This is a trained model of a custom **Reinforce** agent playing **CartPole-v1**.
+This was created for [Unit 4](https://huggingface.co/learn/deep-rl-course/unit4/introduction) of the Hugging Face Deep RL Course. I have added some features, such as [entropy
+loss](https://www.mathworks.com/help/reinforcement-learning/ug/reinforce-policy-gradient-agents.html), and updating the code for Gymnasium versus the deprecated Gym
 Hyperparameters used to train were optimized by [Optuna](https://pypi.org/project/optuna/)