Procedure for sft and trl

#1
by pramodjella - opened

Could you please share information about instruction format followed and datasets used , Reward model and other information

Sign up or log in to comment