sft_rewardmodel_final / training_args.bin

Commit History

Upload reward model
12bf163
verified

MMattaparthy commited on