Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
thejaminator
/
5e6_lr_14sep_bigger_batch_step_187
like
0
Text Generation
PEFT
Safetensors
lora
Model card
Files
Files and versions
xet
Community
Use this model
main
5e6_lr_14sep_bigger_batch_step_187
Commit History
verl GRPO trained model at step 187
84373ad
verified
thejaminator
commited on
Sep 15, 2025
initial commit
493d3e5
verified
thejaminator
commited on
Sep 15, 2025