Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
od2961
/
Qwen2.5-7B-Open-R1-GRPO-math-7b
like
0
Text Generation
Transformers
Safetensors
qwen2
Generated from Trainer
trl
grpo
conversational
text-generation-inference
arxiv:
2402.03300
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
Qwen2.5-7B-Open-R1-GRPO-math-7b
Commit History
Training in progress, step 550
253aa0e
verified
od2961
commited on
Oct 3, 2025
Training in progress, step 500
6de28d7
verified
od2961
commited on
Sep 14, 2025
Training in progress, step 450
63b5d6a
verified
od2961
commited on
Sep 13, 2025
Training in progress, step 400
b858f02
verified
od2961
commited on
Sep 12, 2025
Training in progress, step 350
68c9a0b
verified
od2961
commited on
Sep 11, 2025
Training in progress, step 300
bfb2363
verified
od2961
commited on
Sep 10, 2025
Training in progress, step 250
ef69a0b
verified
od2961
commited on
Sep 8, 2025
Training in progress, step 200
527a910
verified
od2961
commited on
Sep 7, 2025
Training in progress, step 150
383d376
verified
od2961
commited on
Sep 6, 2025
Training in progress, step 100
b112760
verified
od2961
commited on
Sep 5, 2025
Training in progress, step 50
3cf9a15
verified
od2961
commited on
Sep 5, 2025
initial commit
eb9d70b
verified
od2961
commited on
Sep 3, 2025