spicyneuron commited on
Commit
0acbdfd
·
verified ·
1 Parent(s): e98b034

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -67,7 +67,7 @@ Note:
67
 
68
  Quantized with a [mlx-lm fork](https://github.com/ml-explore/mlx-lm/pull/922),
69
  drawing inspiration from Unsloth/AesSedai/ubergarm style mixed-precision GGUFs.
70
- MLX quantization options differ than llama.cpp, but the principles are the
71
  same:
72
 
73
  - Sensitive layers like MoE routing, attention, and output embeddings get higher precision
 
67
 
68
  Quantized with a [mlx-lm fork](https://github.com/ml-explore/mlx-lm/pull/922),
69
  drawing inspiration from Unsloth/AesSedai/ubergarm style mixed-precision GGUFs.
70
+ MLX quantization options differ from llama.cpp, but the principles are the
71
  same:
72
 
73
  - Sensitive layers like MoE routing, attention, and output embeddings get higher precision