Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
llmware
/
phi-3.5-mini-instruct-onnx-qnn
like
0
Follow
llmware
256
ONNX
phi3
green
llmware-chat
p3
qnn
emerald
custom_code
4-bit precision
gptq
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
main
phi-3.5-mini-instruct-onnx-qnn
1.97 GB
1 contributor
History:
4 commits
doberst
Update README.md
a56c919
verified
about 2 months ago
.gitattributes
1.52 kB
initial commit
about 2 months ago
README.md
1.04 kB
Update README.md
about 2 months ago
added_tokens.json
293 Bytes
Upload 20 files
about 2 months ago
chat_template.jinja
430 Bytes
Upload 20 files
about 2 months ago
config.json
3.79 kB
Upload 20 files
about 2 months ago
configuration_phi3.py
11.2 kB
Upload 20 files
about 2 months ago
context_0_ctx_qnn.bin
460 MB
xet
Upload 20 files
about 2 months ago
context_1_ctx_qnn.bin
460 MB
xet
Upload 20 files
about 2 months ago
context_2_ctx_qnn.bin
460 MB
xet
Upload 20 files
about 2 months ago
context_3_ctx_qnn.bin
460 MB
xet
Upload 20 files
about 2 months ago
context_ctx.onnx
1.63 MB
xet
Upload 20 files
about 2 months ago
embeddings.onnx
61.6 MB
xet
Upload 20 files
about 2 months ago
genai_config.json
17.8 kB
Upload 20 files
about 2 months ago
generation_config.json
172 Bytes
Upload 20 files
about 2 months ago
inference_model.json
420 Bytes
Upload 20 files
about 2 months ago
iterator_ctx.onnx
1.63 MB
xet
Upload 20 files
about 2 months ago
lm_head.onnx
61.6 MB
xet
Upload 20 files
about 2 months ago
modeling_phi3.py
73.8 kB
Upload 20 files
about 2 months ago
special_tokens_map.json
569 Bytes
Upload 20 files
about 2 months ago
tokenizer.json
3.62 MB
Upload 20 files
about 2 months ago
tokenizer.model
500 kB
xet
Upload 20 files
about 2 months ago
tokenizer_config.json
2.93 kB
Upload 20 files
about 2 months ago