Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Qwen
/
Qwen3-4B-FP8
like
38
Follow
Qwen
81.8k
Text Generation
Transformers
Safetensors
qwen3
conversational
text-generation-inference
fp8
arxiv:
2309.00071
arxiv:
2505.09388
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
2
Deploy
Use this model
main
Qwen3-4B-FP8
Commit History
Create LICENSE
96b30dc
verified
littlebird13
commited on
Jul 26, 2025
Update README.md
5dad03d
verified
littlebird13
commited on
May 21, 2025
update tokenizer_config.json
f3ecd40
feihu.hf
commited on
May 19, 2025
Remove vLLM FP8 Limitation (
#2
)
d968998
verified
jklj077
simon-mo
commited on
Apr 30, 2025
Update README.md
0dcffbe
verified
yangapku
commited on
Apr 29, 2025
Update README.md
884ae87
verified
yangapku
commited on
Apr 29, 2025
Update README.md
bcd75a3
verified
yangapku
commited on
Apr 28, 2025
Update README.md
35fec96
verified
littlebird13
commited on
Apr 28, 2025
Update README.md
1ef33a9
verified
jklj077
commited on
Apr 28, 2025
Delete special_tokens_map.json
97f8501
verified
littlebird13
commited on
Apr 28, 2025
Delete added_tokens.json
be2fe05
verified
littlebird13
commited on
Apr 28, 2025
Update README.md
c1919f6
verified
littlebird13
commited on
Apr 28, 2025
Update generation_config.json
e66e5a4
verified
littlebird13
commited on
Apr 28, 2025
Update README.md
1d3f2ab
verified
littlebird13
commited on
Apr 28, 2025
Upload folder using huggingface_hub
ae9c71f
verified
littlebird13
commited on
Apr 28, 2025
initial commit
3fdd654
verified
littlebird13
commited on
Apr 28, 2025