Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
HerrHruby
/
vanilla_grpo_acemath_rl_4b_inst_32k_step_180
like
0
Safetensors
qwen3
Model card
Files
Files and versions
xet
Community
main
vanilla_grpo_acemath_rl_4b_inst_32k_step_180
Commit History
Upload trained model
299d1e3
verified
HerrHruby
commited on
Nov 27, 2025
initial commit
8a373f9
verified
HerrHruby
commited on
Nov 27, 2025