RewardModel / rng_state_2.pth

Commit History

First model version
289bb98

Xingyu Fu commited on