Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
thejaminator
/
grpo-feature-vector-step-1
like
0
PEFT
Safetensors
English
verl
grpo
math
reasoning
rl
lora
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Use this model
main
grpo-feature-vector-step-1
Commit History
verl GRPO trained model at step 1
ade0533
verified
thejaminator
commited on
Aug 27, 2025
verl GRPO trained model at step 1
1c54e58
verified
thejaminator
commited on
Aug 15, 2025
initial commit
5b05b05
verified
thejaminator
commited on
Aug 15, 2025