Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Blancy
/
Qwen3-1.7B-Open-R1-Code-GRPO
like
0
Text Generation
Transformers
Safetensors
Blancy/verifiable-coding-problems-CoT
qwen3
Generated from Trainer
open-r1
trl
grpo
conversational
text-generation-inference
arxiv:
2402.03300
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
Qwen3-1.7B-Open-R1-Code-GRPO
/
config.json
Commit History
Training in progress, step 20
5e5eda5
verified
Blancy
commited on
Jul 21, 2025
Training in progress, epoch 1
53d561a
verified
Blancy
commited on
Jul 19, 2025
Training in progress, epoch 1
196ef1b
verified
Blancy
commited on
Jul 18, 2025
Training in progress, epoch 1
ec274d1
verified
Blancy
commited on
Jul 18, 2025
Training in progress, step 10
ad1b427
verified
Blancy
commited on
Jul 14, 2025
Training in progress, step 10
d1cb81c
verified
Blancy
commited on
Jul 7, 2025
Training in progress, step 10
779c6c4
verified
Blancy
commited on
Jul 4, 2025
Training in progress, epoch 1
f24f451
verified
Blancy
commited on
Jul 1, 2025
Training in progress, epoch 1
114efb0
verified
Blancy
commited on
Jun 28, 2025
End of training
abfc92f
verified
Blancy
commited on
Jun 26, 2025
Training in progress, epoch 1
c50d066
verified
Blancy
commited on
Jun 26, 2025
Training in progress, epoch 1
287d796
verified
Blancy
commited on
Jun 25, 2025
End of training
21ce5f5
verified
Blancy
commited on
Jun 25, 2025
Training in progress, epoch 1
4706082
verified
Blancy
commited on
Jun 25, 2025
Training in progress, epoch 1
58dbcc9
verified
Blancy
commited on
Jun 21, 2025
Training in progress, epoch 1
4c76218
verified
Blancy
commited on
Jun 18, 2025