RewardModel / rng_state_0.pth

Commit History

First model version
289bb98

Xingyu Fu commited on