Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
foamliu
/
Xmodel2-1.2B-Open-R1-GRPO
like
0
Text Generation
Transformers
TensorBoard
Safetensors
open-r1/OpenR1-Math-220k
minicpm
Generated from Trainer
open-r1
trl
grpo
conversational
custom_code
arxiv:
2402.03300
Model card
Files
Files and versions
xet
Metrics
Training metrics
Community
Deploy
Use this model
main
Xmodel2-1.2B-Open-R1-GRPO
Commit History
End of training
283e6d5
verified
foamliu
commited on
Mar 11, 2025
Model save
60ca924
verified
foamliu
commited on
Mar 11, 2025
initial commit
bb2ee27
verified
foamliu
commited on
Mar 5, 2025