Qwen-3.5-unsloth-mlx Collection AWQ-style pre-scaling using Unsloth's imatrix calibration data, then 3-6-bit affine quantization with the Unsloth mixed-precision recipe via MLX • 20 items • Updated about 2 hours ago • 10
view article Article GGML and llama.cpp join HF to ensure the long-term progress of Local AI +4 Feb 20 • 490
MS3.2-PaintedFantasy-v4-24B MLX Collection MLX Quants of zerofata's MS3.2-PaintedFantasy-v4-24B • 6 items • Updated 23 days ago • 1
Cerebras REAP Collection Sparse MoE models compressed using REAP (Router-weighted Expert Activation Pruning) method • 30 items • Updated Feb 25 • 133
💧 LFM2.5 Collection Collection of Instruct, Base, and Japanese LFM2.5-1.2B models. • 22 items • Updated 15 days ago • 107