Ornith-1.0 Collection Ornith-1.0 is a family of open-source LLMs specialized for agentic coding. • 8 items • Updated 6 days ago • 289
Gemma 4 QAT Collection Gemma 4 QAT (Quantization-Aware Training) for 3x less memory use and near original accuracy. • 16 items • Updated 18 days ago • 98
Qwen 3.x MTP Collection MLX MTP drafter checkpoints for Qwen 3.x speculative decoding with mlx-vlm. • 12 items • Updated Jun 1 • 9
Gemma-4-31B-IT-unsloth-mlx Collection Gemma-4-31B-IT (dense vision-language) quantized for Apple Silicon (MLX) — Unsloth Dynamic 2.0 with AWQ imatrix pre-scaling. • 9 items • Updated May 16 • 1
Gemma-4-26B-IT-unsloth-mlx Collection Gemma-4-26B-A4B-IT MoE quantized for Apple Silicon (MLX) — Unsloth Dynamic 2.0 with AWQ imatrix pre-scaling. • 8 items • Updated May 16 • 1
Qwen-3.6-unsloth-mlx Collection AWQ-style pre-scaling using Unsloth's imatrix calibration data, then 3-6-bit affine quantization with the Unsloth mixed-precision recipe via MLX • 18 items • Updated May 15 • 19
Qwen3.5 Collection Qwen3.5 is Qwen's new model family including Qwen3.5 Small: 0.8B, 2B, 4B, 9B and Qwen3.5 Medium: 35B-A3B, 27B, 122B-A10B and 397B-A17B. • 25 items • Updated 18 days ago • 161
Qwen3.5-122B-A10B Collection MINT quantized versions of Qwen3.5-122B-A10B at multiple budget targets (MLX & GGUF) • 4 items • Updated Apr 7 • 2
APEX Quants (GGUF) Collection MoE models quantized with the APEX Quantization technique ( https://github.com/mudler/apex-quant ) • 39 items • Updated 1 day ago • 124
Gemma 4 Collection Gemma 4 is Google's new model family including including E2B, E4B, 26B-A4B, and 31B. • 36 items • Updated 18 days ago • 227
Qwen-3.5-unsloth-mlx Collection AWQ-style pre-scaling using Unsloth's imatrix calibration data, then 3-6-bit affine quantization with the Unsloth mixed-precision recipe via MLX • 20 items • Updated Mar 29 • 20