majentik's picture
Add MLX 2-bit quantized model with KV cache compression
0f832ed verified