Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
QuantTrio
/
KAT-V1-40B-GPTQ-Int4-Int8Mix
like
0
Follow
QuantTrio
246
Text Generation
Transformers
Safetensors
qwen2
AWQ
量化修复
vLLM
conversational
text-generation-inference
4-bit precision
gptq
arxiv:
2507.08297
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
KAT-V1-40B-GPTQ-Int4-Int8Mix
Commit History
Delete .mv
7965360
verified
JunHowie
commited on
Sep 5, 2025
Delete .msc
67e7669
verified
JunHowie
commited on
Sep 5, 2025
Delete .mdl
a1133f6
verified
JunHowie
commited on
Sep 5, 2025
Delete .ipynb_checkpoints
bbe7c95
verified
JunHowie
commited on
Sep 5, 2025
Upload folder using huggingface_hub
12b36ba
verified
JunHowie
commited on
Jul 31, 2025
initial commit
49a5453
verified
JunHowie
commited on
Jul 31, 2025