Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
thejaminator
/
12sep_grp16_1e5_lr-step-60
like
0
Text Generation
PEFT
Safetensors
lora
Model card
Files
Files and versions
xet
Community
Use this model
main
12sep_grp16_1e5_lr-step-60
Commit History
verl GRPO trained model at step 60
f564542
verified
thejaminator
commited on
Sep 12, 2025
initial commit
2a6162c
verified
thejaminator
commited on
Sep 12, 2025