Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
RedHatAI
/
quantization
like
6
Follow
Red Hat AI
2.5k
kernel
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
3
Copy to bucket
new
c31b5ce
quantization
Commit History
Add GPTQ-Marlin
c31b5ce
danieldk
HF Staff
commited on
Dec 10, 2024
Build
c5018b2
danieldk
HF Staff
commited on
Dec 9, 2024
Add `scaled_(int|fp8)_quant` and `fp8_marlin_gemm`
5c6fb68
danieldk
HF Staff
commited on
Dec 9, 2024
Build
a77838d
danieldk
HF Staff
commited on
Dec 9, 2024
Fixup metadata
c7e38f0
danieldk
HF Staff
commited on
Dec 9, 2024
Add cutlass_w8a8
b4cad21
danieldk
HF Staff
commited on
Dec 9, 2024
initial commit
e87d8e6
verified
danieldk
HF Staff
commited on
Dec 9, 2024