majentik
/

Leanstral-RotorQuant-MLX-2bit

@@ -2,23 +2,20 @@
 base_model: mistralai/Leanstral-2603
 library_name: mlx
 tags:
-- rotorquant
-- kv-cache-quantization
-- mlx
-- 2-bit
-- weight-quantization
-- leanstral
-- lean4
-- formal-proofs
-- theorem-proving
-- quantized
-- apple-silicon
-- mistral
-- moe
 license: apache-2.0
-pipeline_tag: text-generation
-language:
-- en
 ---
 # Leanstral-RotorQuant-MLX-2bit
@@ -95,3 +92,31 @@ Leanstral excels at:
 - [majentik/Leanstral-RotorQuant-MLX-1bit](https://huggingface.co/majentik/Leanstral-RotorQuant-MLX-1bit) -- MLX 1-bit + RotorQuant
 - [majentik/Leanstral-TurboQuant-MLX-2bit](https://huggingface.co/majentik/Leanstral-TurboQuant-MLX-2bit) -- MLX 2-bit + TurboQuant
 - [RotorQuant repository](https://github.com/scrya-com/rotorquant)

 base_model: mistralai/Leanstral-2603
 library_name: mlx
 tags:
+  - rotorquant
+  - kv-cache-quantization
+  - mlx
+  - 2-bit
+  - weight-quantization
+  - leanstral
+  - lean4
+  - formal-proofs
+  - theorem-proving
+  - quantized
+  - apple-silicon
+  - mistral
+  - moe
 license: apache-2.0
 ---
 # Leanstral-RotorQuant-MLX-2bit
 - [majentik/Leanstral-RotorQuant-MLX-1bit](https://huggingface.co/majentik/Leanstral-RotorQuant-MLX-1bit) -- MLX 1-bit + RotorQuant
 - [majentik/Leanstral-TurboQuant-MLX-2bit](https://huggingface.co/majentik/Leanstral-TurboQuant-MLX-2bit) -- MLX 2-bit + TurboQuant
 - [RotorQuant repository](https://github.com/scrya-com/rotorquant)
+## Quant trade-off (MLX lane)
+| Bits | Approx size | Use case | Recommendation |
+|---|---|---|---|
+| **2-bit** | ~31 GB | Aggressive quantization | **Very low-RAM Macs** |
+| 3-bit | ~43 GB | Lossy but small | Low-RAM Macs |
+| 4-bit | ~50 GB | Balanced default | Recommended for most Macs |
+| 5-bit | ~60 GB | Higher fidelity | Quality-sensitive |
+| 6-bit | ~71 GB | Approaching FP16 quality | High-fidelity |
+| 8-bit | ~90 GB | Near-lossless reference | Fidelity-critical work |
+(Current variant — **2bit** — is bolded.)
+## Variants in this family
+(Showing 8 sibling variants under `majentik/leanstral-*`. The current variant — `RotorQuant-MLX-2bit` — is **bolded**.)
+| Variant | Runtime | Approx size | Use case |
+|---|---|---|---|
+| [RotorQuant](https://huggingface.co/majentik/leanstral-rotorquant) | runtime modifier | n/a | KV-cache root (weight-agnostic) |
+| **RotorQuant-MLX-2bit** | mlx-lm | card-only | Apple Silicon, smallest |
+| [RotorQuant-MLX-4bit](https://huggingface.co/majentik/leanstral-rotorquant-mlx-4bit) | mlx-lm | card-only | Apple Silicon balanced |
+| [RotorQuant-MLX-8bit](https://huggingface.co/majentik/leanstral-rotorquant-mlx-8bit) | mlx-lm | card-only | Apple Silicon reference |
+| [TurboQuant](https://huggingface.co/majentik/leanstral-turboquant) | runtime modifier | n/a | KV-cache root (weight-agnostic) |
+| [TurboQuant-MLX-2bit](https://huggingface.co/majentik/leanstral-turboquant-mlx-2bit) | mlx-lm | card-only | Apple Silicon, smallest |
+| [TurboQuant-MLX-4bit](https://huggingface.co/majentik/leanstral-turboquant-mlx-4bit) | mlx-lm | card-only | Apple Silicon balanced |
+| [TurboQuant-MLX-8bit](https://huggingface.co/majentik/leanstral-turboquant-mlx-8bit) | mlx-lm | card-only | Apple Silicon reference |