Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
webAI-Official
/
math-finetuned-grpo-adapter
like
0
Follow
webAI
28
Text Generation
PEFT
Safetensors
Transformers
grpo
lora
trl
conversational
arxiv:
1910.09700
Model card
Files
Files and versions
xet
Community
Use this model
main
math-finetuned-grpo-adapter
/
README.md
Commit History
Update README.md
c8a009f
verified
abdul-hannan
commited on
27 days ago
Upload 10 files
f40d5a5
verified
abdul-hannan
commited on
27 days ago
initial commit
511612c
verified
abdul-hannan
commited on
27 days ago