Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
rvs
/
llama3-8b-Instruct-kvc-AWQ-int4-onnx-split
like
0
ONNX
text-generation-inference
llama
llama3
Model card
Files
Files and versions
xet
Community
Deploy
main
llama3-8b-Instruct-kvc-AWQ-int4-onnx-split
6.79 GB
Ctrl+K
Ctrl+K
1 contributor
History:
2 commits
rvs
Upload folder using huggingface_hub
76527e7
verified
8 months ago
.gitattributes
1.72 kB
Upload folder using huggingface_hub
8 months ago
README.md
6.94 kB
Upload folder using huggingface_hub
8 months ago
combined_weights.data
5.73 GB
xet
Upload folder using huggingface_hub
8 months ago
config.json
Safe
0 Bytes
Upload folder using huggingface_hub
8 months ago
entrypoint.py
26.9 kB
Upload folder using huggingface_hub
8 months ago
model.onnx
2.26 MB
xet
Upload folder using huggingface_hub
8 months ago
onnx__MatMul_10363
1.05 GB
xet
Upload folder using huggingface_hub
8 months ago
special_tokens_map.json
Safe
301 Bytes
Upload folder using huggingface_hub
8 months ago
token_id_to_str.json
Safe
2.8 MB
Upload folder using huggingface_hub
8 months ago
tokenizer.json
Safe
9.09 MB
Upload folder using huggingface_hub
8 months ago
tokenizer_config.json
Safe
51 kB
Upload folder using huggingface_hub
8 months ago