Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
joshkaura
/
ppo-CartPole-v1
like
0
Reinforcement Learning
PyTorch
TensorBoard
CartPole-v1
ppo
cleanrl
deep-rl-class
custom-implementation
Eval Results (legacy)
Model card
Files
Files and versions
xet
Metrics
Training metrics
Community
PPO Agent Playing CartPole-v1
Results
PPO Agent Playing CartPole-v1
Trained with a minimal CleanRL-style PPO implementation in Google Colab.
Results
Mean reward:
83.60
Std reward:
50.09
Downloads last month
-
Downloads are not tracked for this model.
How to track
Video Preview
Reinforcement Learning
loading
Evaluation results
mean_reward
on CartPole-v1
self-reported
83.60 +/- 50.09