Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
FastFlowLM
/
Llama-3.1-8B-NPU2
like
1
Text Generation
English
llama
llama-3.1
AMD
Ryzen
NPU
conversational
License:
llama3
Model card
Files
Files and versions
xet
Community
main
Llama-3.1-8B-NPU2
Ctrl+K
Ctrl+K
2 contributors
History:
14 commits
FastFlowLM
Update README.md
5ac86d4
verified
9 months ago
.gitattributes
Safe
343 Bytes
support_prefill
11 months ago
README.md
Safe
2.21 kB
Update README.md
9 months ago
attn.xclbin
Safe
698 kB
xet
support_prefill
11 months ago
config.json
Safe
1.02 kB
Update config.json
10 months ago
dequant.xclbin
Safe
116 kB
xet
update_dequant
10 months ago
layer.xclbin
332 kB
xet
update_glu
10 months ago
lm_head.xclbin
153 kB
xet
support_prefill
11 months ago
mm.xclbin
Safe
348 kB
xet
new_mm
10 months ago
model.q4nx
Safe
5.74 GB
xet
add_model
about 1 year ago
tokenizer.json
Safe
17.2 MB
xet
add_model
about 1 year ago
tokenizer_config.json
Safe
4.33 kB
Add tokenizer_config
10 months ago