All quants supported by mmq
https://github.com/ggml-org/llama.cpp/blob/e86f3c22211d9b5c3842e2961a022aac9cdbacad/ggml/src/ggml-cuda/mmq.cu#L269-L294
For measuring cublas/rocblas vs mmq perf