Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Yukang
/
Qwen2.5-3B-Open-R1-Code-GRPO
like
0
Text Generation
Transformers
Safetensors
open-r1/verifiable-coding-problems-python
qwen2
Generated from Trainer
open-r1
trl
grpo
conversational
text-generation-inference
arxiv:
2402.03300
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
Qwen2.5-3B-Open-R1-Code-GRPO
Commit History
Training in progress, step 250
6eeb77b
verified
Yukang
commited on
Jun 14, 2025
Training in progress, step 250
da451ff
verified
Yukang
commited on
Jun 14, 2025
Training in progress, step 200
768c079
verified
Yukang
commited on
Jun 14, 2025
Training in progress, step 200
1ce461c
verified
Yukang
commited on
Jun 14, 2025
Training in progress, step 200
2816fe6
verified
Yukang
commited on
Jun 14, 2025
Training in progress, step 200
4244897
verified
Yukang
commited on
Jun 14, 2025
Training in progress, step 150
1918044
verified
Yukang
commited on
Jun 13, 2025
Training in progress, step 150
d1899d0
verified
Yukang
commited on
Jun 13, 2025
Training in progress, step 150
a0b09ac
verified
Yukang
commited on
Jun 13, 2025
Training in progress, step 150
dfa9d33
verified
Yukang
commited on
Jun 13, 2025
Training in progress, step 100
e4b1b42
verified
Yukang
commited on
Jun 13, 2025
Training in progress, step 100
dd4f78d
verified
Yukang
commited on
Jun 13, 2025
Training in progress, step 100
6bc6d04
verified
Yukang
commited on
Jun 13, 2025
Training in progress, step 100
9670292
verified
Yukang
commited on
Jun 13, 2025
Training in progress, step 50
a2b4013
verified
Yukang
commited on
Jun 13, 2025
Training in progress, step 50
a4b89a6
verified
Yukang
commited on
Jun 13, 2025
Training in progress, step 50
bc35cfd
verified
Yukang
commited on
Jun 13, 2025
Training in progress, step 50
3b14c19
verified
Yukang
commited on
Jun 13, 2025
initial commit
87da196
verified
Yukang
commited on
Jun 13, 2025
Previous
1
2
3
4
Next