Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

123

Base only

Active filters: ik_llama.cpp

ubergarm/Ling-1T-GGUF

Text Generation • Updated Oct 29, 2025 • 72 • 12

ubergarm/Kimi-K2-Thinking-GGUF

Text Generation • Updated Nov 20, 2025 • 317 • 22

ubergarm/GigaChat3-10B-A1.8B-GGUF

Text Generation • Updated Nov 21, 2025 • 60 • 13

sokann/GLM-4.5-GGUF-3.2162bpw

358B • Updated Nov 29, 2025 • 3

sokann/GLM-4.6-GGUF-3.2263bpw

357B • Updated Nov 30, 2025 • 1

ubergarm/Devstral-2-123B-Instruct-2512-GGUF

Text Generation • Updated Dec 10, 2025 • 8 • 7

ubergarm/GLM-4.7-GGUF

Text Generation • 358B • Updated Dec 24, 2025 • 504 • 25

ubergarm/Devstral-Small-2-24B-Instruct-2512-GGUF

Text Generation • 24B • Updated Dec 25, 2025 • 13 • 1

ubergarm/MiMo-V2-Flash-GGUF

Text Generation • 309B • Updated Jan 6 • 14 • 4

ubergarm/DeepSeek-V3.2-Speciale-GGUF

Text Generation • 671B • Updated Jan 9 • 97 • 18

ubergarm/GLM-4.7-Flash-GGUF

Text Generation • 30B • Updated Jan 21 • 159 • 24

ubergarm/Kimi-K2.5-GGUF

Text Generation • 1T • Updated Feb 6 • 26 • 11

ubergarm/Step-3.5-Flash-GGUF

Text Generation • 197B • Updated Feb 16 • 152 • 42

ubergarm/MiniMax-M2.5-GGUF

Text Generation • 229B • Updated Feb 15 • 984 • 51

ubergarm/GLM-5-GGUF

Text Generation • 754B • Updated Feb 15 • 190 • 16

ubergarm/Qwen3.5-397B-A17B-GGUF

Text Generation • 396B • Updated Apr 3 • 526 • 37

ubergarm/Qwen3-Coder-Next-GGUF

Text Generation • 80B • Updated Feb 26 • 386 • 21

ubergarm/Qwen3.5-122B-A10B-GGUF

Text Generation • 122B • Updated Mar 20 • 2.85k • 20

ubergarm/Qwen3.5-35B-A3B-GGUF

Text Generation • 35B • Updated Mar 16 • 342 • 10

ubergarm/Qwen3.5-27B-GGUF

Text Generation • 27B • Updated Apr 24 • 130 • 18

sokann/GLM-5-GGUF-1.594bpw

754B • Updated Mar 1 • 48 • 3

sokann/Qwen3.5-27B-GGUF-4.165bpw

27B • Updated Mar 13 • 30 • 9

KeinNiemand/Qwen3.5-122B-A10B-IK_GGUF

Text Generation • 122B • Updated Mar 13 • 20 • 1

Uninformed/GLM-4.7-Architect-355B-A32B-GGUF

Text Generation • Updated Mar 13 • 51 • 2

sokann/Qwen3.5-27B-GGUF-4.915bpw

27B • Updated Mar 13 • 12 • 6

sokann/Qwen3.5-27B-GGUF-4.151bpw

27B • Updated Mar 13 • 19 • 1

ubergarm/GLM-5.1-GGUF

Text Generation • 754B • Updated Apr 9 • 490 • 25

AlexanderKyng/Mistral-Small-119B-2603-ik-GGUF

Text Generation • 119B • Updated Apr 11 • 36

sokann/GLM-5.1-GGUF-1.673bpw

754B • Updated Apr 12 • 302 • 4

ubergarm/MiniMax-M2.7-GGUF

Text Generation • 229B • Updated Apr 15 • 381 • 21