Procedure for sft and trl
#1
by pramodjella - opened
Could you please share information about instruction format followed and datasets used , Reward model and other information
Could you please share information about instruction format followed and datasets used , Reward model and other information