Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
w601sxs
/
b1ade_0.5B
like
0
Reinforcement Learning
Safetensors
w601sxs/simplecot_subset_50k
English
qwen2
grpo
rag
qwen
b1ade
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
main
b1ade_0.5B
1,000 MB
Ctrl+K
Ctrl+K
1 contributor
History:
4 commits
w601sxs
Add model card
64e14bc
verified
17 days ago
.gitattributes
Safe
1.57 kB
Upload tokenizer
20 days ago
README.md
Safe
3.02 kB
Add model card
17 days ago
chat_template.jinja
Safe
2.51 kB
Upload tokenizer
20 days ago
config.json
Safe
1.28 kB
Upload Qwen2ForCausalLM
20 days ago
generation_config.json
Safe
215 Bytes
Upload Qwen2ForCausalLM
20 days ago
model.safetensors
988 MB
xet
Upload Qwen2ForCausalLM
20 days ago
tokenizer.json
Safe
11.4 MB
xet
Upload tokenizer
20 days ago
tokenizer_config.json
Safe
665 Bytes
Upload tokenizer
20 days ago