PEFT
Safetensors
llama
alignment-handbook
trl
sft
Generated from Trainer
sft_r1_barc_pot_10k / tokenizer.json

Commit History

Training in progress, epoch 1
df3b3a6
verified

aadityap commited on