How to use kernels-community/quantization-gptq with Kernels:
# !pip install kernels from kernels import get_kernel kernel = get_kernel("kernels-community/quantization-gptq")
An example would be helpful. Is absmax the maximum value rather than the scale? Also, will you provide a Marlin kernel?
· Sign up or log in to comment