Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Efficient-ML
/
LLaMA-3-8B-QuIP-2bit
like
3
Follow
Efficient Intelligence and Systems
34
Text Generation
Transformers
PyTorch
llama
conversational
text-generation-inference
Model card
Files
Files and versions
xet
Community
1
Deploy
Use this model
refs/pr/1
LLaMA-3-8B-QuIP-2bit
32.1 GB
2 contributors
History:
3 commits
SFconvertbot
Adding `safetensors` variant of this model
d8c470e
verified
over 1 year ago
.gitattributes
1.52 kB
initial commit
almost 2 years ago
config.json
733 Bytes
Upload LLaMA-3-8B-QuIP-2bit
almost 2 years ago
generation_config.json
121 Bytes
Upload LLaMA-3-8B-QuIP-2bit
almost 2 years ago
model-00001-of-00002.safetensors
9.98 GB
xet
Adding `safetensors` variant of this model
over 1 year ago
model-00002-of-00002.safetensors
6.08 GB
xet
Adding `safetensors` variant of this model
over 1 year ago
model.safetensors.index.json
25.1 kB
Adding `safetensors` variant of this model
over 1 year ago
pytorch_model-00001-of-00002.bin
9.98 GB
xet
Upload LLaMA-3-8B-QuIP-2bit
almost 2 years ago
pytorch_model-00002-of-00002.bin
6.08 GB
xet
Upload LLaMA-3-8B-QuIP-2bit
almost 2 years ago
pytorch_model.bin.index.json
24 kB
Upload LLaMA-3-8B-QuIP-2bit
almost 2 years ago
special_tokens_map.json
73 Bytes
Upload LLaMA-3-8B-QuIP-2bit
almost 2 years ago
tokenizer.json
9.08 MB
Upload LLaMA-3-8B-QuIP-2bit
almost 2 years ago
tokenizer_config.json
50.9 kB
Upload LLaMA-3-8B-QuIP-2bit
almost 2 years ago