ppo-CartPole-v1 / README.md
arnemaass's picture
PPO agent upload from Colab
45cf6f5 verified

PPO Agent for CartPole-v1

Trained from scratch using CleanRL-style PPO implementation.

Mean Reward: 103.40 ± 54.35