majentik's picture
Add MLX quantized model with KV cache compression
e0a8d0a verified