Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
llmware
/
phi-3-mini-4k-instruct-onnx-qnn
like
0
Follow
llmware
265
ONNX
phi3
green
llmware-chat
p3
qnn
emerald
custom_code
4-bit precision
gptq
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
main
phi-3-mini-4k-instruct-onnx-qnn
1.97 GB
Ctrl+K
Ctrl+K
1 contributor
History:
4 commits
doberst
Update README.md
6fa72ce
verified
4 months ago
.gitattributes
Safe
1.52 kB
initial commit
4 months ago
README.md
1.04 kB
Update README.md
4 months ago
added_tokens.json
Safe
293 Bytes
Upload 21 files
4 months ago
chat_template.jinja
Safe
407 Bytes
Upload 21 files
4 months ago
config.json
1.48 kB
Upload 21 files
4 months ago
configuration_phi3.py
Safe
11.2 kB
Upload 21 files
4 months ago
context_0_ctx_qnn.bin
460 MB
xet
Upload 21 files
4 months ago
context_1_ctx_qnn.bin
460 MB
xet
Upload 21 files
4 months ago
context_2_ctx_qnn.bin
460 MB
xet
Upload 21 files
4 months ago
context_3_ctx_qnn.bin
460 MB
xet
Upload 21 files
4 months ago
context_ctx.onnx
1.63 MB
xet
Upload 21 files
4 months ago
embeddings.onnx
61.6 MB
xet
Upload 21 files
4 months ago
genai_config.json
17.8 kB
Upload 21 files
4 months ago
generation_config.json
Safe
172 Bytes
Upload 21 files
4 months ago
hash_record_sha256.json
1.14 kB
Upload 21 files
4 months ago
inference_model.json
419 Bytes
Upload 21 files
4 months ago
iterator_ctx.onnx
1.63 MB
xet
Upload 21 files
4 months ago
lm_head.onnx
61.6 MB
xet
Upload 21 files
4 months ago
modeling_phi3.py
Safe
73.2 kB
Upload 21 files
4 months ago
special_tokens_map.json
Safe
561 Bytes
Upload 21 files
4 months ago
tokenizer.json
Safe
3.62 MB
Upload 21 files
4 months ago
tokenizer.model
Safe
500 kB
xet
Upload 21 files
4 months ago
tokenizer_config.json
2.95 kB
Upload 21 files
4 months ago