PEFT
Safetensors
llama
alignment-handbook
trl
sft
Generated from Trainer
sft_r1_barc_pot_10k / trainer_state.json

Commit History

Model save
eabb649
verified

aadityap commited on