Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
tttx
/
sft_r1_barc_pot_10k
like
0
Follow
tttx
8
PEFT
Safetensors
tttx/r1-trajectories-collection-round-2
tttx/r1-trajectories-arcagi-barc
llama
alignment-handbook
trl
sft
Generated from Trainer
License:
llama3.1
Model card
Files
Files and versions
xet
Community
Use this model
main
sft_r1_barc_pot_10k
Commit History
End of training
d2bc978
verified
aadityap
commited on
Feb 7, 2025
Model save
eabb649
verified
aadityap
commited on
Feb 7, 2025
Training in progress, epoch 3
6c93b53
verified
aadityap
commited on
Feb 7, 2025
Training in progress, epoch 2
c0b685e
verified
aadityap
commited on
Feb 7, 2025
Training in progress, epoch 1
df3b3a6
verified
aadityap
commited on
Feb 7, 2025
initial commit
74c1d18
verified
aadityap
commited on
Feb 7, 2025