deep-rl-course / results.json

Commit History

train ppo model with 1,000,000 time steps
6f66be1

sigma-bit-dot commited on