Inference Providers
Active filters: 2-bit
shubhamg2208/tomoro-ai-colqwen3-embed-4b-auto-round-w2a16g32
Matt300209/autoround_test
1B • Updated • 1
mradermacher/Fairy2i-W2-GGUF
Text Generation
• 7B • Updated • 97
mradermacher/Fairy2i-W2-i1-GGUF
Text Generation
• 7B • Updated • 270
alexgusevski/Ministral-3-3B-Instruct-2512-q2-mlx
Text Generation
• 0.3B • Updated • 8
alexgusevski/Ministral-3-3B-Reasoning-2512-q2-mlx
Text Generation
• 0.3B • Updated • 9
alexgusevski/Ministral-3-8B-Instruct-2512-q2-mlx
Text Generation
• 0.8B • Updated • 8
alexgusevski/Ministral-3-8B-Reasoning-2512-q2-mlx
Text Generation
• 0.8B • Updated • 4
alexgusevski/Lightning-1.7B-q2-mlx
Text Generation
• 0.2B • Updated • 2
mlx-community/YandexGPT-5-Lite-8B-instruct-q2
Text Generation
• 0.8B • Updated • 78
• 2
garrison/Magidonia-24B-v4.3-mlx-2Bit
24B • Updated • 8
garrison/Cydonia-24B-v4.3-mlx-2Bit
24B • Updated • 192
• 1
INC4AI/Qwen3-8B-w2g64-AutoRound-test
2B • Updated • 5
• 1
Wwayu/GLM-4.7-PRISM-mlx-2Bit
Text Generation
• 353B • Updated • 670
• 4
SiddhJagani/Nemotron-Cascade-14B-Thinking-Claude-4.5-Opus-High-Reasoning-Distill-mlx-Q2
Text Generation
• 1B • Updated • 36
cosmicgob/Qwen3-30B-A3B-Instruct-2507-mlx-2Bit
Text Generation
• 31B • Updated • 27
cosmicgob/Qwen3-0.6B-mlx-2Bit
Text Generation
• 55.9M • Updated • 16
• 1
AiAF/Midnight-Miqu-70B-v1.5-MLX-2Bit
Text Generation
• 69B • Updated • 65
AiAF/rp-sft-merged_1000-MLX-2Bit
Text Generation
• 0.3B • Updated • 13
Eldadalbajob/GLM-4.7-REAP-50-mlx-2Bit
Text Generation
• 185B • Updated • 124
• 2
alexgusevski/LFM2.5-1.2B-Nova-Function-Calling-q2-mlx
Text Generation
• 0.1B • Updated • 33
alexgusevski/LFM2.5-VL-1.6B-q2-mlx
Image-Text-to-Text
• 0.5B • Updated • 9
alexgusevski/Youtu-LLM-2B-q2-mlx
Text Generation
• 0.2B • Updated • 2
• 1
alexgusevski/HY-MT1.5-1.8B-q2-mlx
Text Generation
• 0.2B • Updated • 69
alexgusevski/Huihui-HY-MT1.5-7B-abliterated-q2-mlx
Text Generation
• 0.7B • Updated • 16
alexgusevski/Huihui-Qwen3-VL-4B-Instruct-abliterated-q2-mlx
Image-Text-to-Text
• 0.8B • Updated • 25
ParetoQaft/8B-Tulu-full-gptq-2bit
ParetoQaft/1B-Tulu-full-gptq-2bit
Matt300209/autoround_style_unmerged
0.4B • Updated • 1
alexgusevski/Falcon-H1R-7B-q2-mlx
Text Generation
• 0.7B • Updated • 7