Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
krisezra87
/
Qwen2.5-1.5B-Open-R1-Code-GRPO
like
0
Text Generation
Transformers
Safetensors
open-r1/verifiable-coding-problems-python
qwen2
Generated from Trainer
open-r1
trl
grpo
conversational
text-generation-inference
arxiv:
2402.03300
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
Qwen2.5-1.5B-Open-R1-Code-GRPO
Commit History
End of training
328eb6e
verified
krisezra87
commited on
May 3, 2025
Model save
2b426b1
verified
krisezra87
commited on
May 3, 2025
Training in progress, step 502
d5d80c2
verified
krisezra87
commited on
May 3, 2025
End of training
60740a6
verified
krisezra87
commited on
May 3, 2025
Model save
bed4485
verified
krisezra87
commited on
May 3, 2025
Training in progress, step 501
7c54f42
verified
krisezra87
commited on
May 3, 2025
Training in progress, step 450
15347dc
verified
krisezra87
commited on
May 3, 2025
Training in progress, step 400
9627d23
verified
krisezra87
commited on
May 3, 2025
Training in progress, step 350
7fe0da2
verified
krisezra87
commited on
May 3, 2025
Training in progress, step 300
aef2a4d
verified
krisezra87
commited on
May 2, 2025
Training in progress, step 250
3f6a1fd
verified
krisezra87
commited on
May 2, 2025
Training in progress, step 200
072bda2
verified
krisezra87
commited on
May 2, 2025
Training in progress, step 150
75797e5
verified
krisezra87
commited on
May 2, 2025
Training in progress, step 100
6c29afc
verified
krisezra87
commited on
May 2, 2025
Training in progress, step 50
c023668
verified
krisezra87
commited on
May 2, 2025
initial commit
b856036
verified
krisezra87
commited on
May 1, 2025