Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
leosaros
/
14bgrpo
like
0
PEFT
Safetensors
Transformers
AutoModel
grpo
lora
trl
unsloth
custom_code
arxiv:
1910.09700
Model card
Files
Files and versions
xet
Community
Use this model
main
14bgrpo
/
tokenizer.json
Commit History
Upload 10 files
11c9ad9
verified
leosaros
commited on
Nov 26, 2025