Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
purbeshmitra
/
vanillaGRPO
like
0
Text Generation
Transformers
Safetensors
openai/gsm8k
HuggingFaceH4/MATH-500
HuggingFaceH4/aime_2024
English
arxiv:
2507.02851
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
1
Deploy
Use this model
eb940ae
vanillaGRPO
148 MB
Ctrl+K
Ctrl+K
2 contributors
History:
2 commits
purbeshmitra
Upload 3 files
eb940ae
verified
10 months ago
.gitattributes
Safe
1.52 kB
initial commit
10 months ago
README.md
5.12 kB
Upload 3 files
10 months ago
adapter_config.json
Safe
876 Bytes
Upload 3 files
10 months ago
adapter_model.safetensors
Safe
148 MB
xet
Upload 3 files
10 months ago