Inference Providers
Active filters: 2-bit
irish-quant/Qwen-Qwen3-14B-2bit
15B • Updated • 1
irish-quant/Qwen-Qwen3-32B-2bit
33B • Updated • 2
irish-quant/Qwen-Qwen3-4B-2bit
4B • Updated • 4
irish-quant/Qwen-Qwen3-8B-2bit
8B • Updated • 1
nafisabdkhan/DeepSeek-Coder-V2-Instruct-mlx-2Bit
22B • Updated • 129
mlx-community/GLM-4.5-Air-2bit
Text Generation
• 107B • Updated • 196
• 4
isetnefret/Hermes-3-Llama-3.1-70B-mlx-2Bit
7B • Updated • 37
isetnefret/Strawberrylemonade-L3-70B-v1.1-mlx-2Bit
Text Generation
• 7B • Updated • 8
mlx-community/GLM-4.5-Air-2bit-DWQ
Text Generation
• 107B • Updated • 33
• 2
DreamsOfControl/Llama-3SOME-8B-v2-mlx-2Bit
0.8B • Updated • 6
DreamsOfControl/Moistral-11B-v3-mlx-2Bit
1B • Updated • 11
monirmamoun/GLM-4.5-MLX-2bit
Text Generation
• 353B • Updated • 10
derek-wh/Qwen3-235B-A22B-Instruct-2507-mlx-2Bit
Text Generation
• 235B • Updated • 43
huseyincavus/gemma-3-270m-it-mlx-2Bit
Text Generation
• 40.9M • Updated • 4
mrtoots/unsloth-GLM-4.5-MLX-2Bit
Text Generation
• 353B • Updated • 20
OPEA/DeepSeek-R1-0528-int2-mixed-AutoRound
56B • Updated • 2
MaziyarPanahi/Qwen3-4B-Instruct-2507-GGUF
Text Generation
• 4B • Updated • 209k
• 2
MaziyarPanahi/Qwen3-30B-A3B-Instruct-2507-GGUF
Text Generation
• 31B • Updated • 111k
• 4
Matt300209/1B-instruct-int2
1B • Updated • 1
Matt300209/1B-tulu-sft-int2
1B • Updated • 1
Matt300209/8B-instruct-int2
8B • Updated Matt300209/8B-tulu-sft-int2
8B • Updated • 2
genai-archive/Qwen3-30B-A3B-Instruct-2507-MLX-Q2
Text Generation
• 31B • Updated • 8
Text Generation
• 0.5B • Updated • 3
Text Generation
• 0.5B • Updated • 1
1B • Updated • 1
jesusoctavioas/gpt-oss-120b-mlx-2Bit
Text Generation
• 117B • Updated • 315
• 2
mrtoots/unsloth-Hermes-4-405B-mlx-2Bit
Text Generation
• 406B • Updated • 57
mrtoots/Hermes-3-Llama-3.1-405B-mlx-2Bit
Text Generation
• 406B • Updated • 62
ncard/Qwen3-30B-A3B-Thinking-2507-mlx-2Bit
Text Generation
• 31B • Updated • 4