Rl-Unit1-CoLab / results.json

Commit History

First try of Unit 1 CoLab with PPO trained with 1 million steps
33ba93c
verified

alanwsx commited on