Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
bigatuna
/
Qwen3-0.6B-Sushi-Coder
like
1
Text Generation
Transformers
Safetensors
microsoft/rStar-Coder
open-r1/codeforces-cots
English
qwen3
code
sft
grpo
trl
conversational
text-generation-inference
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
Qwen3-0.6B-Sushi-Coder
1.21 GB
1 contributor
History:
22 commits
bigatuna
Upload README.md with huggingface_hub
d5e6e54
verified
20 days ago
.gitattributes
1.63 kB
Upload Qwen3-0.6B-Sushi-Coder.png with huggingface_hub
20 days ago
Qwen3-0.6B-Sushi-Coder.png
1.68 MB
xet
Upload Qwen3-0.6B-Sushi-Coder.png with huggingface_hub
20 days ago
README.md
2.93 kB
Upload README.md with huggingface_hub
20 days ago
added_tokens.json
707 Bytes
Training in progress, step 100
20 days ago
chat_template.jinja
4.17 kB
Training in progress, step 100
20 days ago
config.json
1.36 kB
Upload Qwen3ForCausalLM
20 days ago
generation_config.json
188 Bytes
Upload Qwen3ForCausalLM
20 days ago
merges.txt
1.67 MB
Training in progress, step 100
20 days ago
model.safetensors
1.19 GB
xet
Upload Qwen3ForCausalLM
20 days ago
special_tokens_map.json
613 Bytes
Training in progress, step 100
20 days ago
tokenizer.json
11.4 MB
xet
Upload tokenizer
20 days ago
tokenizer_config.json
5.54 kB
Training in progress, step 100
20 days ago
vocab.json
2.78 MB
Training in progress, step 100
20 days ago