Inference Providers
Active filters: 2-bit
MikeRoz/Meta-Llama-3.1-405B-Instruct-2.0bpw-h4-exl3
Text Generation
• 53B • Updated • 3
• 1
sleepdeprived3/Baptist-Christian-Bible-Expert-v2.0-24B_EXL2_2bpw_H8
Text Generation
• Updated • 3
phires/Llama-3.2-1B-Instruct-GGUF-rk3588-1.1.2
Text Generation
• Updated MaziyarPanahi/GLM-4-32B-0414-GGUF
Text Generation
• 33B • Updated • 36
• 1
MaziyarPanahi/cogito-v1-preview-llama-3B-GGUF
Text Generation
• 4B • Updated • 74
• 1
MaziyarPanahi/cogito-v1-preview-llama-8B-GGUF
Text Generation
• 8B • Updated • 49
• 1
MaziyarPanahi/cogito-v1-preview-llama-70B-GGUF
Text Generation
• 71B • Updated • 57
• 1
Erland/softpick-340M-4096-model-GPTQ-2bit
Text Generation
• 0.4B • Updated • 2
Erland/vanilla-340M-4096-model-GPTQ-2bit
Text Generation
• 0.4B • Updated • 1
kaitchup/Qwen2.5-72B-Instruct-autoround-2bit-32g-2048-gptq
73B • Updated • 1
kaitchup/Qwen2.5-72B-Instruct-autoround-2bit-64g-2048-gptq
73B • Updated • 1
kaitchup/Qwen2.5-72B-Instruct-autoround-2bit-128g-2048-gptq
73B • Updated • 1
kaitchup/Qwen2.5-72B-Instruct-autoround-2bit-32g-4096-gptq
73B • Updated • 1
kaitchup/Qwen2.5-72B-Instruct-autoround-2bit-64g-4096-gptq
73B • Updated • 1
kaitchup/Qwen2.5-72B-Instruct-autoround-2bit-128g-4096-gptq
73B • Updated • 1
steampunque/Llama-4-Scout-17B-16E-Instruct-MP-GGUF
Text Generation
• 108B • Updated • 113
• 1
MaziyarPanahi/Qwen3-0.6B-GGUF
Text Generation
• 0.8B • Updated • 294k
• 11
MaziyarPanahi/Qwen3-1.7B-GGUF
Text Generation
• 2B • Updated • 287k
• 7
MaziyarPanahi/Qwen3-8B-GGUF
Text Generation
• 8B • Updated • 288k
• 10
MaziyarPanahi/Qwen3-4B-GGUF
Text Generation
• 4B • Updated • 288k
• 7
MaziyarPanahi/Qwen3-14B-GGUF
Text Generation
• 15B • Updated • 288k
• 9
MaziyarPanahi/Qwen3-32B-GGUF
Text Generation
• 33B • Updated • 281k
• 2
MaziyarPanahi/Qwen3-30B-A3B-GGUF
Text Generation
• 31B • Updated • 282k
• 4
16B • Updated kaitchup/Qwen3-32B-autoround-2bit-gptq
33B • Updated • 7.52k
• 1
Cozmicalz/Lycanthropic-Thoughts-32B-mlx-2Bit
3B • Updated • 3
kaitchup/Qwen3-14B-autoround-2bit-gptq
15B • Updated • 430
kaitchup/Qwen3-8B-autoround-2bit-gptq
8B • Updated • 536
DanyDA/Qwen3-32B-exl3-2.0bpw
Text Generation
• 5B • Updated • 1
Siddharth63/Qwen3-8B-Base-2bits-AutoRound-GPTQ-sym
8B • Updated • 11