Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
- Website
- Community
- Solutions
Log In
Sign Up

RedHatAI
/

quantization

Model card Files Files and versions

533 MB

Ctrl+K

Ctrl+K

2 contributors

History: 6 commits

danieldk's picture

danieldk HF Staff

Build

c5018b2 over 1 year ago

build
Build over 1 year ago
compressed_tensors
Add `scaled_(int|fp8)_quant` and `fp8_marlin_gemm` over 1 year ago
core
Add `scaled_(int|fp8)_quant` and `fp8_marlin_gemm` over 1 year ago
cutlass_extensions
Add cutlass_w8a8 over 1 year ago
cutlass_w8a8
Add cutlass_w8a8 over 1 year ago
ext-torch
Add `scaled_(int|fp8)_quant` and `fp8_marlin_gemm` over 1 year ago
fp8
Add `scaled_(int|fp8)_quant` and `fp8_marlin_gemm` over 1 year ago
gptq_marlin
Add `scaled_(int|fp8)_quant` and `fp8_marlin_gemm` over 1 year ago
.gitattributes

1.56 kB
Build over 1 year ago
LICENSE

11.4 kB
Add cutlass_w8a8 over 1 year ago
README.md

181 Bytes
Fixup metadata over 1 year ago
build.toml

1.78 kB
Add `scaled_(int|fp8)_quant` and `fp8_marlin_gemm` over 1 year ago
dispatch_utils.h

1.49 kB
Add `scaled_(int|fp8)_quant` and `fp8_marlin_gemm` over 1 year ago