gemma-4-31B-it-RotorQuant-MLX-2bit / tokenizer_config.json

Commit History

Add MLX quantized model with KV cache compression
4c89218
verified

majentik commited on