Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
lmdeploy
/
llama2-chat-7b-w4
like
3
Follow
lmdeploy
18
Text Generation
Transformers
PyTorch
llama
text-generation-inference
License:
llama2
Model card
Files
Files and versions
xet
Community
2
Deploy
Use this model
refs/pr/2
llama2-chat-7b-w4
7.81 GB
1 contributor
History:
7 commits
SFconvertbot
Adding `safetensors` variant of this model
ed554a3
verified
12 months ago
.gitattributes
1.52 kB
initial commit
over 2 years ago
README.md
4.49 kB
Update README.md
over 2 years ago
README_zh-CN.md
3.28 kB
Add README_zh-CN
over 2 years ago
config.json
645 Bytes
Upload AWQ weights and qparams
over 2 years ago
generation_config.json
132 Bytes
Upload AWQ weights and qparams
over 2 years ago
inputs_stats.pth
11.7 MB
xet
Upload AWQ weights and qparams
over 2 years ago
key_stats.pth
813 kB
xet
Upload AWQ weights and qparams
over 2 years ago
model.safetensors
3.89 GB
xet
Adding `safetensors` variant of this model
12 months ago
outputs_stats.pth
16.7 MB
xet
Upload AWQ weights and qparams
over 2 years ago
pytorch_model.bin
3.89 GB
xet
Upload AWQ weights and qparams
over 2 years ago
special_tokens_map.json
411 Bytes
Upload AWQ weights and qparams
over 2 years ago
tokenizer.model
500 kB
xet
Upload AWQ weights and qparams
over 2 years ago
tokenizer_config.json
745 Bytes
Upload AWQ weights and qparams
over 2 years ago
value_stats.pth
813 kB
xet
Upload AWQ weights and qparams
over 2 years ago