Procedure for sft and trl
#1
by
pramodjella
- opened
Could you please share information about instruction format followed and datasets used , Reward model and other information
Could you please share information about instruction format followed and datasets used , Reward model and other information