Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Thomas-Chou
/
Qwen2.5-1.5B-Open-R1-GRPO
like
0
Text Generation
Transformers
Safetensors
open-r1/OpenR1-Math-220k
qwen2
Generated from Trainer
open-r1
trl
grpo
conversational
text-generation-inference
arxiv:
2402.03300
Model card
Files
Files and versions
xet
Community
1
Deploy
Use this model
main
Qwen2.5-1.5B-Open-R1-GRPO
Commit History
End of training
e89fd66
verified
Thomas-Chou
commited on
Jan 9
Model save
e76c580
verified
Thomas-Chou
commited on
Jan 9
Training in progress, step 5600
eefe861
verified
Thomas-Chou
commited on
Jan 8
Training in progress, step 4800
f0d1165
verified
Thomas-Chou
commited on
Jan 8
Training in progress, step 3200
b3f07c4
verified
Thomas-Chou
commited on
Jan 7
Training in progress, step 2400
287f65c
verified
Thomas-Chou
commited on
Jan 4
Training in progress, step 1600
588ce34
verified
Thomas-Chou
commited on
Jan 4
Training in progress, step 800
5635361
verified
Thomas-Chou
commited on
Jan 4
End of training
9fc151c
verified
Thomas-Chou
commited on
Jan 4
Model save
15191f6
verified
Thomas-Chou
commited on
Jan 4
Training in progress, step 2400
1cb3f62
verified
Thomas-Chou
commited on
Jan 3
Training in progress, step 1600
909a33e
verified
Thomas-Chou
commited on
Jan 3
Training in progress, step 800
1c7983a
verified
Thomas-Chou
commited on
Jan 3
End of training
71ef1a3
verified
Thomas-Chou
commited on
Feb 10, 2025
Model save
86cdb62
verified
Thomas-Chou
commited on
Feb 10, 2025
initial commit
d9bb8f5
verified
Thomas-Chou
commited on
Feb 10, 2025