Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
jmkim89
/
OpenRS-GRPO
like
0
Text Generation
Transformers
Safetensors
knoveleng/open-rs
qwen2
Generated from Trainer
open-r1
trl
grpo
conversational
text-generation-inference
arxiv:
2402.03300
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
OpenRS-GRPO
Commit History
End of training
73971ca
verified
jmkim89
commited on
Apr 7, 2025
Model save
9360627
verified
jmkim89
commited on
Apr 7, 2025
Training in progress, step 500
2ad9c90
verified
jmkim89
commited on
Apr 7, 2025
Training in progress, step 450
df695e1
verified
jmkim89
commited on
Apr 7, 2025
Training in progress, step 400
7a40c5d
verified
jmkim89
commited on
Apr 7, 2025
Training in progress, step 350
3bd8fbf
verified
jmkim89
commited on
Apr 6, 2025
Training in progress, step 300
3dd5a39
verified
jmkim89
commited on
Apr 6, 2025
Training in progress, step 250
ffcb6ea
verified
jmkim89
commited on
Apr 6, 2025
Training in progress, step 200
405be57
verified
jmkim89
commited on
Apr 6, 2025
Training in progress, step 150
3796232
verified
jmkim89
commited on
Apr 6, 2025
Training in progress, step 100
4528758
verified
jmkim89
commited on
Apr 6, 2025
Training in progress, step 50
87b1fda
verified
jmkim89
commited on
Apr 6, 2025
initial commit
7779a21
verified
jmkim89
commited on
Apr 6, 2025