Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
clowman
/
Llama-3.2-3B-Instruct-GPTQ-Int8
like
0
Text Generation
Transformers
Safetensors
PyTorch
8 languages
llama
facebook
meta
llama-3
conversational
text-generation-inference
8-bit precision
gptq
arxiv:
2204.05149
arxiv:
2405.16406
License:
llama3.2
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
Llama-3.2-3B-Instruct-GPTQ-Int8
3.69 GB
1 contributor
History:
3 commits
clowman
Update README.md
1dcb6e2
verified
11 months ago
.gitattributes
1.57 kB
Upload folder using huggingface_hub
11 months ago
README.md
42.2 kB
Update README.md
11 months ago
USE_POLICY.md
6.02 kB
Upload folder using huggingface_hub
11 months ago
args-lambda-quant.json
258 Bytes
Upload folder using huggingface_hub
11 months ago
config.json
1.52 kB
Upload folder using huggingface_hub
11 months ago
generation_config.json
184 Bytes
Upload folder using huggingface_hub
11 months ago
model.safetensors
3.68 GB
xet
Upload folder using huggingface_hub
11 months ago
quant_log.csv
8.08 kB
Upload folder using huggingface_hub
11 months ago
quantize_config.json
427 Bytes
Upload folder using huggingface_hub
11 months ago
requirements-lambda-quant.txt
1.6 kB
Upload folder using huggingface_hub
11 months ago
special_tokens_map.json
340 Bytes
Upload folder using huggingface_hub
11 months ago
tokenizer.json
17.2 MB
xet
Upload folder using huggingface_hub
11 months ago
tokenizer_config.json
54.6 kB
Upload folder using huggingface_hub
11 months ago