Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
SamMikaelson
/
grpo-checkpoints-better
like
0
Text Generation
PEFT
Safetensors
Transformers
grpo
lora
sft
trl
unsloth
conversational
arxiv:
1910.09700
Model card
Files
Files and versions
xet
Community
Use this model
main
grpo-checkpoints-better
Commit History
Upload folder using huggingface_hub
6edc835
verified
SamMikaelson
commited on
Dec 7, 2025
Upload folder using huggingface_hub
4d46fad
verified
SamMikaelson
commited on
Dec 7, 2025
Upload folder using huggingface_hub
93a7e0f
verified
SamMikaelson
commited on
Dec 7, 2025
Upload folder using huggingface_hub
9b0d518
verified
SamMikaelson
commited on
Dec 7, 2025
Upload folder using huggingface_hub
895d284
verified
SamMikaelson
commited on
Dec 7, 2025
Upload folder using huggingface_hub
d91eee2
verified
SamMikaelson
commited on
Dec 7, 2025
Upload folder using huggingface_hub
4081a2b
verified
SamMikaelson
commited on
Dec 6, 2025
Upload folder using huggingface_hub
e649261
verified
SamMikaelson
commited on
Dec 4, 2025
Upload folder using huggingface_hub
529c2cd
verified
SamMikaelson
commited on
Dec 4, 2025
Upload folder using huggingface_hub
ef895e2
verified
SamMikaelson
commited on
Dec 4, 2025
Upload folder using huggingface_hub
049e119
verified
SamMikaelson
commited on
Dec 4, 2025
Upload folder using huggingface_hub
10070a0
verified
SamMikaelson
commited on
Dec 4, 2025
initial commit
e32edb8
verified
SamMikaelson
commited on
Dec 4, 2025