Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
HerrHruby
/
vanilla_grpo_acemath_rl_4b_inst_16k_step_180
like
0
Safetensors
qwen3
Model card
Files
Files and versions
xet
Community
main
vanilla_grpo_acemath_rl_4b_inst_16k_step_180
Commit History
Upload trained model
c0481ba
verified
HerrHruby
commited on
Nov 29, 2025
initial commit
1f5c2ab
verified
HerrHruby
commited on
Nov 29, 2025