MERaLiON-2-3B-RotorQuant-MLX-4bit / generation_config.json

Commit History

Add MLX quantized model with KV cache compression
1785a33
verified

majentik commited on