Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
tttx
/
sft_r1_barc_pot_2k
like
0
Follow
tttx
8
PEFT
Safetensors
tttx/r1-trajectories-arcagi-barc
llama
alignment-handbook
trl
sft
Generated from Trainer
License:
llama3.1
Model card
Files
Files and versions
xet
Community
Use this model
main
sft_r1_barc_pot_2k
/
tokenizer.json
Commit History
Training in progress, epoch 1
8536ad0
verified
aadityap
commited on
Feb 7, 2025