Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

68

Base only

Active filters: cerebras

barozp/Qwen3.6-28B-REAP20-A3B-GGUF

Text Generation • 28B • Updated Apr 19 • 21k • 51

SebastianSchramm/Cerebras-GPT-111M-instruction

Text Generation • 0.1B • Updated Nov 28, 2023 • 11 • 3

cerebras/Llama3-DocChat-1.0-8B

Text Generation • Updated Aug 16, 2024 • 19 • • 69

NikolayKozloff/Llama3-DocChat-1.0-8B-Q8_0-GGUF

Text Generation • 8B • Updated Aug 21, 2024 • 5 • 6

mattritchey/Llama3-DocChat-1.0-8B-IQ4_NL-GGUF

Text Generation • 8B • Updated Aug 22, 2024 • 8

mattritchey/Llama3-DocChat-1.0-8B-Q4_K_M-GGUF

Text Generation • 8B • Updated Aug 22, 2024

QuantFactory/Llama3-DocChat-1.0-8B-GGUF

Text Generation • 8B • Updated Aug 24, 2024 • 115 • 1

bartowski/Llama3-DocChat-1.0-8B-GGUF

Text Generation • 8B • Updated Aug 30, 2024 • 109

mradermacher/Llama3-DocChat-1.0-8B-GGUF

8B • Updated Jan 22, 2025 • 18 • 1

mradermacher/Llama3-DocChat-1.0-8B-i1-GGUF

8B • Updated Jan 22, 2025 • 53 • 1

cerebras/Llama-3-CBHybridL-8B

Text Generation • 8B • Updated Mar 26, 2025 • 7

MatteoKhan/Cerebras-OPT-Fusion

Text Generation • 7B • Updated Apr 10, 2025 • 5

cerebras/Llama-3-CBHybridM-8B

Text Generation • 8B • Updated Mar 26, 2025 • 4

mradermacher/Cerebras-OPT-Fusion-GGUF

7B • Updated Mar 5, 2025 • 35

mradermacher/Cerebras-OPT-Fusion-i1-GGUF

7B • Updated Mar 5, 2025 • 99

mradermacher/Cerebras-GPT-111M-instruction-GGUF

0.1B • Updated Jul 11, 2025 • 34

mradermacher/Cerebras-GPT-111M-instruction-i1-GGUF

0.1B • Updated Jul 11, 2025 • 86 • 1

0xSero/GLM-4.6-218B-W4A16

Text Generation • 2B • Updated May 30 • 22 • 8

0xSero/GLM-4.7-REAP-40-W4A16

Text Generation • 2B • Updated May 30 • 28 • 7

0xSero/GLM-4.7-185B

Text Generation • 185B • Updated May 30 • 46 • 19

0xSero/GLM-4.7-185B-W4A16

Text Generation • 2B • Updated May 30 • 248 • 69

0xSero/GLM-4.7-202B

Text Generation • 202B • Updated May 30 • 15 • 2

0xSero/DeepSeek-V3.2-345B-W3A16

Text Generation • 2B • Updated May 30 • 28 • 10

mlx-community/GLM-4.7-REAP-50-mixed-3-4-bits

Text Generation • 185B • Updated Jan 4 • 213 • 3

bullerwins/MiniMax-M2.1-REAP-50-GGUF

Text Generation • 116B • Updated Jan 5 • 11 • 1

mradermacher/MiniMax-M2.1-REAP-50-GGUF

116B • Updated Jan 11 • 32 • 5

dolaloichua/GLM-4.7-REAP-50-mlx-4Bit

Text Generation • 185B • Updated Jan 6 • 47

Jon-Nielsen/GLM-4.7-REAP-30-W4A16

Text Generation • 2B • Updated Jan 8 • 4 • 2

AlexGS74/MiniMax-M2.1-REAP-50-mlx-4bit

Text Generation • 116B • Updated Jan 8 • 41 • 2

scaryrawr/GLM-4.7-REAP-50-mlx-3Bit

Text Generation • 185B • Updated Jan 8 • 53