majentik's picture
Add MLX 8-bit quantized model with KV cache compression
24a37dc verified