Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
FastFlowLM
/
Qwen3-8B-NPU2
like
0
Text Generation
Transformers
English
qwen3
qwen
conversational
arxiv:
2309.00071
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
1
Deploy
Use this model
main
Qwen3-8B-NPU2
5.99 GB
2 contributors
History:
10 commits
AlfredXu2
Merge branch 'main' of fastflowlm.co:FastFlowLM/Qwen3-8B-NPU2
51ef993
6 months ago
.gitattributes
343 Bytes
init
6 months ago
README.md
16.2 kB
Update README.md
6 months ago
attn.xclbin
698 kB
xet
init
6 months ago
config.json
870 Bytes
update_version_config
6 months ago
dequant.xclbin
115 kB
xet
update_q4_1
6 months ago
layer.xclbin
305 kB
xet
update_q4_1
6 months ago
lm_head.xclbin
153 kB
xet
update_q4_1
6 months ago
mm.xclbin
348 kB
xet
update_mm
6 months ago
model.q4nx
5.98 GB
xet
update_q4_1
6 months ago
tokenizer.json
11.4 MB
xet
init
6 months ago
tokenizer_config.json
10.1 kB
Update tokenizer_config.json (#1)
6 months ago