Rl-Unit1-CoLab / replay.mp4

Commit History

First try of Unit 1 CoLab with PPO trained with 1 million steps
33ba93c
verified

alanwsx commited on