RewardModel / trainer_state.json

Commit History

First model version
289bb98

Xingyu Fu commited on