Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
leosaros
/
14bgrpo
like
0
PEFT
Safetensors
Transformers
AutoModel
grpo
lora
trl
unsloth
custom_code
arxiv:
1910.09700
Model card
Files
Files and versions
xet
Community
Use this model
main
14bgrpo
Commit History
Add tarball
b648349
3v324v23
commited on
Nov 26, 2025
Add minimal config.json
166d114
3v324v23
commited on
Nov 26, 2025
Update base_model_name to Qwen2.5-14B-Instruct
e728436
3v324v23
commited on
Nov 26, 2025
Upload README.md
2f8e6b9
verified
leosaros
commited on
Nov 26, 2025
Upload 10 files
11c9ad9
verified
leosaros
commited on
Nov 26, 2025
initial commit
6767877
verified
leosaros
commited on
Nov 26, 2025