Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Erland
/
DeepSeek-R1-0528-Qwen3-8B
like
0
Text Generation
Transformers
Safetensors
qwen3
conversational
text-generation-inference
arxiv:
1910.09700
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
DeepSeek-R1-0528-Qwen3-8B
32.8 GB
Ctrl+K
Ctrl+K
1 contributor
History:
3 commits
Erland
Upload tokenizer
bd9d28b
verified
11 months ago
.gitattributes
Safe
1.57 kB
Upload tokenizer
11 months ago
README.md
Safe
5.17 kB
Upload Qwen3ForCausalLM
11 months ago
config.json
910 Bytes
Upload Qwen3ForCausalLM
11 months ago
generation_config.json
Safe
171 Bytes
Upload Qwen3ForCausalLM
11 months ago
model-00001-of-00007.safetensors
4.97 GB
xet
Upload Qwen3ForCausalLM
11 months ago
model-00002-of-00007.safetensors
4.83 GB
xet
Upload Qwen3ForCausalLM
11 months ago
model-00003-of-00007.safetensors
4.83 GB
xet
Upload Qwen3ForCausalLM
11 months ago
model-00004-of-00007.safetensors
5 GB
xet
Upload Qwen3ForCausalLM
11 months ago
model-00005-of-00007.safetensors
4.83 GB
xet
Upload Qwen3ForCausalLM
11 months ago
model-00006-of-00007.safetensors
4.83 GB
xet
Upload Qwen3ForCausalLM
11 months ago
model-00007-of-00007.safetensors
3.46 GB
xet
Upload Qwen3ForCausalLM
11 months ago
model.safetensors.index.json
Safe
32.9 kB
Upload Qwen3ForCausalLM
11 months ago
special_tokens_map.json
Safe
472 Bytes
Upload tokenizer
11 months ago
tokenizer.json
Safe
11.4 MB
xet
Upload tokenizer
11 months ago
tokenizer_config.json
9.49 kB
Upload tokenizer
11 months ago