Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
chansung
/
Qwen2.5-1.5B-Open-R1-Code-GRPO
like
0
Text Generation
Transformers
Safetensors
chansung/verifiable-coding-problems
qwen2
Generated from Trainer
open-r1
trl
grpo
conversational
text-generation-inference
arxiv:
2402.03300
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
Qwen2.5-1.5B-Open-R1-Code-GRPO
/
training_args.bin
Commit History
Training in progress, step 50
999eaf6
verified
chansung
commited on
Nov 4, 2025
Training in progress, step 200
d52e8b7
verified
chansung
commited on
Nov 3, 2025
Training in progress, step 50
f53fb7e
verified
chansung
commited on
Nov 3, 2025
Training in progress, step 50
7ab0c2d
verified
chansung
commited on
Mar 30, 2025
Training in progress, step 50
0e33cbd
verified
chansung
commited on
Mar 29, 2025