Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
keyfan
/
Qwen-72B-Chat-2bit
like
7
Text Generation
Transformers
PyTorch
qwen
custom_code
QUiP
License:
qianwen
Model card
Files
Files and versions
xet
Community
3
Deploy
Use this model
New discussion
New pull request
Resources
PR & discussions documentation
Code of Conduct
Hub documentation
All
Discussions
Pull requests
View closed (1)
Sort: Recently created
量化设备
3
#3 opened about 2 years ago by
tiantian7777
Is there a big performance difference between 2bit quantization and 4bit quantization conversations?
1
#2 opened about 2 years ago by
xldistance