deep-rl-course / README.md

Commit History

train ppo model with 1,000,000 time steps
6f66be1

sigma-bit-dot commited on