Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

5,358

Base only

Active filters: llama.cpp

rjohal164/coachmode-llama32-1b-vbmi-q4km

1B • Updated Mar 3 • 10

albertlieadrian/qwen3-0.6b-codeforces-sft-gguf

0.8B • Updated Mar 3 • 9

MeridianVector/gemma3-4b-vision-gspo-v6-GGUF

4B • Updated Mar 3 • 129

Ex0bit/Qwen3.5-PRISM-Dynamic-Quant-GGUF

Text Generation • 0.8B • Updated Mar 3 • 757 • 8

daydreamwarrior/Nemotron-Research-GooseReason-4B-Instruct-heretic

4B • Updated Mar 17 • 44 • 1

doncoriolan/sec-summarizer-gemma3-gguf

4B • Updated Mar 3 • 11

ai-colombia/qwen3-job-searcher-gguf

8B • Updated Mar 3 • 15

lolilolikon/deepseek_test

8B • Updated Mar 3 • 70

2796gauravc/notifai-lfm25-1.2b

1B • Updated Mar 3 • 14

Basti83/german-support-qwen-gguf

3B • Updated Mar 3 • 111

11-47/GPT2.5.2-NSFW-Codex-0.4B-GGUF

Text Generation • 0.4B • Updated May 2 • 1.28k • 11

mathensley/lfm2.5-it-ft-v2

1B • Updated Mar 3 • 21

MeridianVector/qwen3-5-4b-vision-gspo-v1-GGUF

4B • Updated Mar 3 • 98

youngoffcial/blink-bc-model-gguf

8B • Updated Mar 3 • 9

hinny/Qwen3.5-4B-GGUF-Q4_K_M

4B • Updated Mar 3 • 71

11-47/GPT5.1-High.Reasoning.Codex-0.4B-GGUF

Text Generation • 0.4B • Updated Mar 21 • 319 • 4

pihu21057w/kks

8B • Updated Mar 3 • 67

chhatramani/qwen3-1.7B_nyayalm_7legaltask_1e_GGUF

2B • Updated Mar 3 • 8

xXGioXx/sad_core_v1_q8

2B • Updated Mar 3 • 12

billylo/medgemma-1.5-4b-it-mmproj-gguf

Updated Mar 3 • 56

Colby/Apertus-8B-openhermes-gguf

8B • Updated Mar 3 • 20

tarball0/ELF-Decompiler-GGUF

8B • Updated Apr 8 • 79

Hack337/Qwen3.5-4B-high-reasoning-GGUF

4B • Updated Mar 3 • 260

WithinUsAI/Qwen3-Qrazy.Qoder-0.6B-GGUF

Text Generation • 0.6B • Updated Mar 21 • 161 • 3

professoorr/cyber-qwen2.5-coder-7b-redteam-gguf

8B • Updated Mar 3 • 135

alibidaran/Qwen_fullstackAssist_GGUF

3B • Updated Mar 3 • 26

rjohal164/coachmode-llama31-8b-vbmi-q4km

8B • Updated Mar 4 • 13

mradermacher/notifai-lfm25-1.2b-GGUF

1B • Updated Mar 4 • 27

smpaz7467/my-custom-coder-qwen

8B • Updated Mar 4 • 12

bambuuai/LFM2.5-1.2B-Instruct-GGUF

Text Generation • 1B • Updated Mar 4 • 42