Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

5,338

Base only

Active filters: llama.cpp

morningwould/llama3-bespin-gguf

8B • Updated Mar 14 • 2

NamrataThakur/llama31-8bn_Reinforcement-Fine-Tuned

Question Answering • 8B • Updated Mar 14 • 9

wangzhang/Qwen3.5-122B-A10B-abliterated-GGUF

Text Generation • 122B • Updated Mar 30 • 41 • 5

bagusm/gemma-3-12b-it-heretic

lszczuro/Bielik-1.5B-v3.0-Instruct-polish-riddles-gguf

2B • Updated Mar 14 • 639

machinadeusex/Qwen3.5-Thinking-AIO-GGUF

Image-Text-to-Text • 0.8B • Updated Mar 14 • 121

robcorey/model

8B • Updated Mar 15 • 2

Daga2001/Llama-70B-God-Tier-GGUF

Text Generation • 71B • Updated Mar 15 • 17

chai1208/Qwen3.5-35B-A3B-patent-Q8_0-GGUF

35B • Updated Mar 15 • 282

chhatramani/nyayalm-6legaltaskrag_qwen3-4B-Q4_K_M-GGUF

4B • Updated Mar 15 • 2

NicolasSaba/Clone_Nicolas_V1

8B • Updated Mar 15 • 26

apache256/qwen_finetune

0.8B • Updated Mar 15 • 25

RamRamki/medical-phi3

4B • Updated Mar 15 • 1

mrrobots18/gguf-poc-alignment-nullderef

32 • Updated Mar 15 • 2

JIULANG/unsloth-Qwen3.5-4B-Instruct-CitationMarker-GGUG

4B • Updated Mar 15 • 497

SALEETAI/Medical-Llama-3-GGUF

4B • Updated Mar 15 • 11

Pruthvirahul/astramind-gguf

Text Generation • 8B • Updated Mar 15 • 8

qingyi26/GGUF_clip_memory_corruption

1 • Updated Mar 15

qingyi26/GGUF_semantic_output_manipulation

chhatramani/nyayalm-civilcode_qwen3-4B_2e_GGUF

4B • Updated Mar 15 • 16

ia-espirita/riv-ai

Text Generation • 8B • Updated May 4 • 207 • 4

ishmaifan/patent-sum-llama3.2-3b-gguf

3B • Updated Mar 16 • 36

drissea-ai/drissy-qwen3.5-2b-GGUF

Image-Text-to-Text • 2B • Updated Mar 16 • 58 • 2

rkumar70900/qwen2.5-1.5b-gguf-experiments

Text Generation • 2B • Updated Mar 15 • 155

asazot/gemma_ft_q4_k_m

0.3B • Updated Mar 16 • 1

N-Bot-Int/SmolSam3-GGUF

3B • Updated Mar 16 • 14

kostakoff/Mixtral-8x7B-Instruct-v0.1-GGUF

Text Generation • 47B • Updated Mar 16 • 30

adamwhite625/gemma-2-2b-text2sql-12k-gguf

3B • Updated Mar 16 • 14

Liberol/OnlyGeminiFinalModel

8B • Updated Mar 16 • 6

amps93/Qwen3.5-9B-FT-NER-KR-V2-GGUF

9B • Updated Mar 16 • 48