Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

5,364

Base only

Active filters: llama.cpp

jsrobinson3/beebuddy-gguf

8B • Updated Mar 1 • 16

mirfan899/gluco-gemma

0.3B • Updated Mar 11 • 11

pryoot/qwen_finetune

4B • Updated Mar 1 • 10

codeslord/ministral-14b-web-agent-gguf

14B • Updated Mar 1 • 138

Rabe3/gemma270-suadi-gguf

0.3B • Updated Mar 1 • 9

jsrobinson3/beebuddy-bee-gguf

8B • Updated Mar 1 • 36

amadews/LFM2_5_code_instructions_120k_alpaca_q4

1B • Updated Mar 2 • 72

Acvarius-AI/GhostLlama

1B • Updated Mar 3 • 6

DylM0nster22/qwen_finetune

4B • Updated Mar 2 • 3

Ellbendls/Omnisec1-4b-Thinking-GGUF

4B • Updated Mar 2 • 16

MeridianVector/gemma3-4b-vision-gspo-GGUF

4B • Updated Mar 2 • 158

moxin-org/Qwen3.5-27B-GGUF

Image-Text-to-Text • Updated Mar 2

mustafaulas/brs

8B • Updated Mar 2 • 6

Ellbendls/Omnisec1-1.2b-Thinking-GGUF

1B • Updated Mar 2 • 5

fweyh/spinoza-qwen3-8b-gguf

8B • Updated Mar 2 • 21

nvlan/ndc_finetune

8B • Updated Mar 8 • 28

ibitato/c64-ministral-3-14b-thinking-c64-reasoning-gguf

Text Generation • 14B • Updated Mar 2 • 45

prithivMLmods/Qwen3.5-2B-f32-GGUF

Image-Text-to-Text • 2B • Updated Mar 3 • 328 • 1

prithivMLmods/Qwen3.5-0.8B-f32-GGUF

Image-Text-to-Text • 0.8B • Updated Mar 3 • 445 • 1

NrengifoBTS/redactoria-v3-gold

8B • Updated Mar 6 • 9

prithivMLmods/Qwen3.5-4B-f32-GGUF

Image-Text-to-Text • 4B • Updated Mar 3 • 345 • 1

prithivMLmods/Qwen3.5-9B-f32-GGUF

Image-Text-to-Text • 9B • Updated Mar 3 • 909 • 2

mathensley/lfm2.5-it-ft-gguf

1B • Updated Mar 2 • 24

AlexStepanenko/llama-3.2-B-Instruct-bnb-4bit-demo

3B • Updated Mar 2 • 25

haidar038/tte-ai-assistant-gguf

8B • Updated Mar 2 • 19

AleRo96/Qwen3_14B_v2

15B • Updated Mar 2 • 12

thewillofdee/my-coder-model

8B • Updated Mar 3 • 17

rjohal164/coachmode-llama32-1b-vbmi-q4km

1B • Updated Mar 3 • 10

albertlieadrian/qwen3-0.6b-codeforces-sft-gguf

0.8B • Updated Mar 3 • 9

MeridianVector/gemma3-4b-vision-gspo-v6-GGUF

4B • Updated Mar 3 • 129