Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
purbeshmitra
/
vanillaGRPO
like
0
Text Generation
Transformers
Safetensors
openai/gsm8k
HuggingFaceH4/MATH-500
HuggingFaceH4/aime_2024
English
arxiv:
2507.02851
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
1
Deploy
Use this model
main
vanillaGRPO
/
assets
692 kB
2 contributors
History:
3 commits
purbeshmitra
Rename multiround.png to assets/multiround.png
72a44b3
verified
7 months ago
motif_results.png
Safe
54.1 kB
Rename motif_results.png to assets/motif_results.png
7 months ago
multiround.png
Safe
252 kB
xet
Rename multiround.png to assets/multiround.png
7 months ago
multiround_grpo.png
Safe
386 kB
xet
Rename multiround_grpo.png to assets/multiround_grpo.png
7 months ago