Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
mlxha
/
Qwen-2.5-3B-grpo-code
like
0
Text Generation
Transformers
Safetensors
open-r1/verifiable-coding-problems-python
qwen2
Generated from Trainer
open-r1
trl
grpo
conversational
text-generation-inference
arxiv:
2402.03300
Model card
Files
Files and versions
xet
Community
1
Deploy
Use this model
main
Qwen-2.5-3B-grpo-code
Commit History
End of training
30a486a
verified
mlxha
commited on
Apr 18, 2025
Model save
d055e09
verified
mlxha
commited on
Apr 18, 2025
Training in progress, step 559
badded7
verified
mlxha
commited on
Apr 18, 2025
Training in progress, step 550
7b253e9
verified
mlxha
commited on
Apr 18, 2025
Training in progress, step 525
9ea9cd0
verified
mlxha
commited on
Apr 18, 2025
Training in progress, step 500
d702b7f
verified
mlxha
commited on
Apr 17, 2025
Training in progress, step 475
d416134
verified
mlxha
commited on
Apr 17, 2025
Training in progress, step 450
27af7ed
verified
mlxha
commited on
Apr 17, 2025
Training in progress, step 425
5e7b0fb
verified
mlxha
commited on
Apr 17, 2025
Training in progress, step 400
5c0faca
verified
mlxha
commited on
Apr 17, 2025
Training in progress, step 375
fb67f20
verified
mlxha
commited on
Apr 17, 2025
Training in progress, step 350
d1e198a
verified
mlxha
commited on
Apr 17, 2025
Training in progress, step 325
34dfd58
verified
mlxha
commited on
Apr 17, 2025
Training in progress, step 300
8a2d35f
verified
mlxha
commited on
Apr 17, 2025
Training in progress, step 275
7bca37c
verified
mlxha
commited on
Apr 17, 2025
Training in progress, step 250
7eba746
verified
mlxha
commited on
Apr 17, 2025
Training in progress, step 225
f3b5551
verified
mlxha
commited on
Apr 16, 2025
Training in progress, step 200
c3c48cf
verified
mlxha
commited on
Apr 16, 2025
Training in progress, step 175
a4f84a4
verified
mlxha
commited on
Apr 16, 2025
Training in progress, step 150
b1e411e
verified
mlxha
commited on
Apr 16, 2025
Training in progress, step 125
f28d34b
verified
mlxha
commited on
Apr 16, 2025
Training in progress, step 100
d746bb7
verified
mlxha
commited on
Apr 16, 2025
Training in progress, step 75
5d8ff35
verified
mlxha
commited on
Apr 16, 2025
Training in progress, step 50
6f420ef
verified
mlxha
commited on
Apr 16, 2025
Training in progress, step 25
1a12ca7
verified
mlxha
commited on
Apr 16, 2025
initial commit
3602bec
verified
mlxha
commited on
Apr 15, 2025