Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
RedHatAI
/
quantization
like
6
Follow
Red Hat AI
2.07k
kernel
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
3
59b2fef
quantization
Ctrl+K
Ctrl+K
2 contributors
History:
32 commits
danieldk
HF Staff
Sync capabilities with upstream
59b2fef
12 months ago
build
Build
12 months ago
compressed_tensors
Sync with vLLM
about 1 year ago
core
Sync with vLLM
about 1 year ago
cutlass_extensions
Sync with vLLM
about 1 year ago
cutlass_w8a8
Sync with vLLM
about 1 year ago
fp8
Sync with vLLM
about 1 year ago
gptq_marlin
Sync with vLLM
about 1 year ago
marlin
Add full Marlin support and tests for Marlin/CUTLASS
over 1 year ago
tests
Add full Marlin support and tests for Marlin/CUTLASS
over 1 year ago
torch-ext
Add support for ROCm
12 months ago
.gitattributes
Safe
1.56 kB
Build
over 1 year ago
LICENSE
Safe
11.4 kB
Add cutlass_w8a8
over 1 year ago
README.md
Safe
195 Bytes
Update README.md (#1)
about 1 year ago
build.toml
3.25 kB
Sync capabilities with upstream
12 months ago
dispatch_utils.h
Safe
1.49 kB
Add `scaled_(int|fp8)_quant` and `fp8_marlin_gemm`
over 1 year ago
flake.lock
3.03 kB
Add support for ROCm
12 months ago
flake.nix
Safe
335 Bytes
Add support for ROCm
12 months ago
vectorization.cuh
Safe
778 Bytes
Sync with vLLM
about 1 year ago