Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
tttx
/
sft_r1_7b
like
0
Follow
tttx
8
PEFT
Safetensors
tttx/r1-trajectories-collection-round-2
tttx/r1-trajectories-arcagi-barc
qwen2
alignment-handbook
trl
sft
Generated from Trainer
License:
mit
Model card
Files
Files and versions
xet
Community
Use this model
main
sft_r1_7b
/
tokenizer.json
Commit History
Training in progress, epoch 1
a9b3e2e
verified
aadityap
commited on
Feb 4, 2025
Model save
215344e
verified
aadityap
commited on
Feb 3, 2025