Inference Providers
Active filters: cerebras
barozp/Qwen3.6-28B-REAP20-A3B-GGUF
Text Generation
• 28B • Updated • 21k
• 51
SebastianSchramm/Cerebras-GPT-111M-instruction
Text Generation
• 0.1B • Updated • 11
• 3
cerebras/Llama3-DocChat-1.0-8B
Text Generation
• Updated • 19
• • 69
NikolayKozloff/Llama3-DocChat-1.0-8B-Q8_0-GGUF
Text Generation
• 8B • Updated • 5
• 6
mattritchey/Llama3-DocChat-1.0-8B-IQ4_NL-GGUF
Text Generation
• 8B • Updated • 8
mattritchey/Llama3-DocChat-1.0-8B-Q4_K_M-GGUF
Text Generation
• 8B • Updated QuantFactory/Llama3-DocChat-1.0-8B-GGUF
Text Generation
• 8B • Updated • 115
• 1
bartowski/Llama3-DocChat-1.0-8B-GGUF
Text Generation
• 8B • Updated • 109
mradermacher/Llama3-DocChat-1.0-8B-GGUF
8B • Updated • 18
• 1
mradermacher/Llama3-DocChat-1.0-8B-i1-GGUF
8B • Updated • 53
• 1
cerebras/Llama-3-CBHybridL-8B
Text Generation
• 8B • Updated • 7
MatteoKhan/Cerebras-OPT-Fusion
Text Generation
• 7B • Updated • 5
cerebras/Llama-3-CBHybridM-8B
Text Generation
• 8B • Updated • 4
mradermacher/Cerebras-OPT-Fusion-GGUF
7B • Updated • 35
mradermacher/Cerebras-OPT-Fusion-i1-GGUF
7B • Updated • 99
mradermacher/Cerebras-GPT-111M-instruction-GGUF
0.1B • Updated • 34
mradermacher/Cerebras-GPT-111M-instruction-i1-GGUF
0.1B • Updated • 86
• 1
0xSero/GLM-4.6-218B-W4A16
Text Generation
• 2B • Updated • 22
• 8
0xSero/GLM-4.7-REAP-40-W4A16
Text Generation
• 2B • Updated • 28
• 7
Text Generation
• 185B • Updated • 46
• 19
0xSero/GLM-4.7-185B-W4A16
Text Generation
• 2B • Updated • 248
• 69
Text Generation
• 202B • Updated • 15
• 2
0xSero/DeepSeek-V3.2-345B-W3A16
Text Generation
• 2B • Updated • 28
• 10
mlx-community/GLM-4.7-REAP-50-mixed-3-4-bits
Text Generation
• 185B • Updated • 213
• 3
bullerwins/MiniMax-M2.1-REAP-50-GGUF
Text Generation
• 116B • Updated • 11
• 1
mradermacher/MiniMax-M2.1-REAP-50-GGUF
116B • Updated • 32
• 5
dolaloichua/GLM-4.7-REAP-50-mlx-4Bit
Text Generation
• 185B • Updated • 47
Jon-Nielsen/GLM-4.7-REAP-30-W4A16
Text Generation
• 2B • Updated • 4
• 2
AlexGS74/MiniMax-M2.1-REAP-50-mlx-4bit
Text Generation
• 116B • Updated • 41
• 2
scaryrawr/GLM-4.7-REAP-50-mlx-3Bit
Text Generation
• 185B • Updated • 53