Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
- Website
- Community
- Solutions
Log In
Sign Up

RedHatAI
/

quantization

Model card Files Files and versions

1.05 GB

Ctrl+K

Ctrl+K

2 contributors

History: 34 commits

danieldk's picture

danieldk HF Staff

Build

cfc95fb about 1 year ago

build
Build about 1 year ago
compressed_tensors
Sync with vLLM over 1 year ago
core
Sync with vLLM over 1 year ago
cutlass_extensions
Sync with vLLM over 1 year ago
cutlass_w8a8
Sync with vLLM over 1 year ago
fp8
Sync with vLLM over 1 year ago
gptq_marlin
Sync with vLLM over 1 year ago
marlin
Add full Marlin support and tests for Marlin/CUTLASS over 1 year ago
tests
Add full Marlin support and tests for Marlin/CUTLASS over 1 year ago
torch-ext
Add support for ROCm about 1 year ago
.gitattributes

1.56 kB
Build over 1 year ago
LICENSE

11.4 kB
Add cutlass_w8a8 over 1 year ago
README.md

195 Bytes
Update README.md (#1) over 1 year ago
build.toml

3.25 kB
Sync capabilities with upstream about 1 year ago
dispatch_utils.h

1.49 kB
Add `scaled_(int|fp8)_quant` and `fp8_marlin_gemm` over 1 year ago
flake.lock

3.03 kB
Update flake about 1 year ago
flake.nix

335 Bytes
Add support for ROCm about 1 year ago
vectorization.cuh

778 Bytes
Sync with vLLM over 1 year ago