Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
HerrHruby
/
vanilla_grpo_acemath_rl_4b_inst_16k_step_180
like
0
Safetensors
qwen3
Model card
Files
Files and versions
xet
Community
main
vanilla_grpo_acemath_rl_4b_inst_16k_step_180
8.06 GB
1 contributor
History:
2 commits
HerrHruby
Upload trained model
c0481ba
verified
13 days ago
.gitattributes
Safe
1.57 kB
Upload trained model
13 days ago
added_tokens.json
Safe
707 Bytes
Upload trained model
13 days ago
chat_template.jinja
Safe
2.63 kB
Upload trained model
13 days ago
config.json
Safe
728 Bytes
Upload trained model
13 days ago
generation_config.json
Safe
121 Bytes
Upload trained model
13 days ago
merges.txt
Safe
1.67 MB
Upload trained model
13 days ago
model.safetensors
8.04 GB
xet
Upload trained model
13 days ago
special_tokens_map.json
Safe
613 Bytes
Upload trained model
13 days ago
tokenizer.json
Safe
11.4 MB
xet
Upload trained model
13 days ago
tokenizer_config.json
Safe
5.41 kB
Upload trained model
13 days ago
vocab.json
Safe
2.78 MB
Upload trained model
13 days ago