Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
wzx111
/
Qwen2.5-1.5B-Open-R1-GRPO
like
0
Text Generation
Transformers
Safetensors
watermelonhjg/MATH-lighteval-level_2
13 languages
qwen2
Generated from Trainer
open-r1
trl
grpo
conversational
text-generation-inference
arxiv:
2402.03300
Model card
Files
Files and versions
xet
Community
1
Deploy
Use this model
main
Qwen2.5-1.5B-Open-R1-GRPO
Commit History
Improve language tag (
#1
)
dbd3dbe
verified
wzx111
lbourdois
commited on
Apr 28, 2025
Training in progress, epoch 0
1aea3a5
verified
wzx111
commited on
Apr 17, 2025
Training in progress, epoch 1
573d1e1
verified
wzx111
commited on
Apr 16, 2025
Training in progress, epoch 0
feedc43
verified
wzx111
commited on
Apr 16, 2025
End of training
c3f25c7
verified
wzx111
commited on
Apr 15, 2025
Model save
a5527ba
verified
wzx111
commited on
Apr 15, 2025
Training in progress, epoch 3
f38a46f
verified
wzx111
commited on
Apr 15, 2025
Training in progress, epoch 2
b4fbf5d
verified
wzx111
commited on
Apr 15, 2025
Training in progress, epoch 1
a1a45f0
verified
wzx111
commited on
Apr 15, 2025
Training in progress, epoch 0
e5aa979
verified
wzx111
commited on
Apr 15, 2025
Update README.md
80c45bb
verified
wzx111
commited on
Apr 15, 2025
End of training
01f851e
verified
wzx111
commited on
Apr 15, 2025
Model save
98b724d
verified
wzx111
commited on
Apr 15, 2025
Training in progress, epoch 3
7434824
verified
wzx111
commited on
Apr 15, 2025
Training in progress, epoch 2
2fd65a1
verified
wzx111
commited on
Apr 14, 2025
Training in progress, epoch 1
203faf7
verified
wzx111
commited on
Apr 14, 2025
Training in progress, epoch 0
d760317
verified
wzx111
commited on
Apr 14, 2025
End of training
7a376aa
verified
wzx111
commited on
Apr 9, 2025
Model save
e721dfa
verified
wzx111
commited on
Apr 9, 2025
Training in progress, epoch 1
b101d88
verified
wzx111
commited on
Apr 9, 2025
initial commit
886e317
verified
wzx111
commited on
Apr 1, 2025