Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
wlzhou
/
Qwen2.5-3B-Open-R1-GRPO
like
0
Text Generation
Transformers
Safetensors
qwen2
Generated from Trainer
trl
grpo
conversational
text-generation-inference
arxiv:
2402.03300
Model card
Files
Files and versions
xet
Community
1
Deploy
Use this model
main
Qwen2.5-3B-Open-R1-GRPO
/
training_args.bin
Commit History
Training in progress, step 50
8f1dddf
verified
wlzhou
commited on
Mar 13, 2025
Training in progress, step 50
6f19bb7
verified
wlzhou
commited on
Mar 12, 2025
Training in progress, step 50
384b2bb
verified
wlzhou
commited on
Mar 11, 2025
Training in progress, step 50
0495fe7
verified
wlzhou
commited on
Mar 10, 2025
Training in progress, step 50
ca514e9
verified
wlzhou
commited on
Mar 10, 2025
Training in progress, step 50
eaa0195
verified
wlzhou
commited on
Mar 9, 2025
Training in progress, step 50
49dc355
verified
wlzhou
commited on
Mar 9, 2025
Training in progress, step 50
61d349b
verified
wlzhou
commited on
Mar 8, 2025
Training in progress, step 50
520ea9c
verified
wlzhou
commited on
Mar 8, 2025
Training in progress, step 200
3f6f542
verified
wlzhou
commited on
Mar 7, 2025
Training in progress, step 50
257a38a
verified
wlzhou
commited on
Mar 6, 2025
Training in progress, step 50
ad19cfa
verified
wlzhou
commited on
Mar 6, 2025
Training in progress, step 50
792b392
verified
wlzhou
commited on
Mar 6, 2025
Training in progress, step 50
d7497f0
verified
wlzhou
commited on
Mar 6, 2025
Training in progress, step 50
207d61b
verified
wlzhou
commited on
Mar 5, 2025