PEFT
Safetensors
llama
alignment-handbook
trl
sft
Generated from Trainer
sft_r1_barc_pot_10k / config.json

Commit History

End of training
d2bc978
verified

aadityap commited on