PEFT
Safetensors
qwen2
alignment-handbook
trl
sft
Generated from Trainer
sft_r1_7b / tokenizer.json

Commit History

Training in progress, epoch 1
a9b3e2e
verified

aadityap commited on

Model save
215344e
verified

aadityap commited on