Update weights to v2 open-world RL step-100 checkpoint (79.31% held-out hypothesis accuracy) 5e3f92b verified Jarrodbarnes commited on 3 days ago