Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
RedHatAI
/
quantization
like
6
Follow
Red Hat AI
1.87k
kernel
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
3
70da85f
quantization
3.87 GB
2 contributors
History:
56 commits
danieldk
HF Staff
Sync updates for CUDA 13 compat
70da85f
20 days ago
attention
Sync to vLLM 20250627
7 months ago
build
Build (aarch64-linux)
6 months ago
compressed_tensors
Sync updates for CUDA 13 compat
20 days ago
core
Sync to vLLM 20250627
7 months ago
cutlass_extensions
Sync to vLLM 20250627
7 months ago
cutlass_w8a8
Sync to vLLM 20250627
7 months ago
fp8
Sync updates for CUDA 13 compat
20 days ago
gptq_marlin
Sync to vLLM 20250627
7 months ago
marlin
Sync to vLLM 20250627
7 months ago
tests
Sync to vLLM 20250627
7 months ago
torch-ext
Fix absolute imports
7 months ago
.gitattributes
1.56 kB
Build
about 1 year ago
LICENSE
11.4 kB
Add cutlass_w8a8
about 1 year ago
README.md
195 Bytes
Update README.md (#1)
12 months ago
build.toml
6 kB
Sync updates for CUDA 13 compat
20 days ago
cub_helpers.h
416 Bytes
Sync updates for CUDA 13 compat
20 days ago
cuda_utils.h
1.41 kB
Sync on vLLM 20240402
10 months ago
dispatch_utils.h
3.9 kB
Sync to vLLM 20250627
7 months ago
flake.lock
2.48 kB
Update flake
21 days ago
flake.nix
335 Bytes
Update flake
21 days ago
utils.cuh
1.84 kB
Sync on vLLM 20240402
10 months ago
vectorization.cuh
878 Bytes
Sync to vLLM 20250627
7 months ago
vectorization_utils.cuh
2.61 kB
Sync to vLLM 20250627
7 months ago