Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

5,392

Base only

Active filters: llama.cpp

nawta/Heron-NVILA-Lite-2B-Q8_0-GGUF

Image-Text-to-Text • 2B • Updated Nov 28, 2025 • 5

nawta/Heron-NVILA-Lite-2B-F16-GGUF

Image-Text-to-Text • 2B • Updated Nov 28, 2025 • 10

nawta/Heron-NVILA-Lite-2B-Q2_K-GGUF

Image-Text-to-Text • 2B • Updated Nov 28, 2025 • 9

nawta/Heron-NVILA-Lite-2B-Q3_K_M-GGUF

Image-Text-to-Text • 2B • Updated Nov 28, 2025 • 10

nawta/Heron-NVILA-Lite-2B-Q5_K_M-GGUF

Image-Text-to-Text • 2B • Updated Nov 28, 2025 • 7

nawta/Heron-NVILA-Lite-2B-Q6_K-GGUF

Image-Text-to-Text • 2B • Updated Nov 28, 2025 • 8 • 1

prithivMLmods/Qwen3-VisionCaption-2B

Image-Text-to-Text • 2B • Updated Nov 29, 2025 • 18 • 5

prithivMLmods/Qwen3-VisionCaption-2B-GGUF

Image-Text-to-Text • 2B • Updated Nov 29, 2025 • 1.29k • 9

hellstone1918/test-model

3B • Updated Nov 28, 2025 • 2

lefteris6/aziz-llm-llama-3.2-3B-Instruct-unsloth

3B • Updated Nov 28, 2025 • 5

mradermacher/Qwen3-VisionCaption-2B-GGUF

2B • Updated Nov 29, 2025 • 87 • 1

mradermacher/Qwen3-VisionCaption-2B-i1-GGUF

2B • Updated Dec 4, 2025 • 140 • 1

conff/model

1B • Updated Nov 29, 2025 • 1

Kaleemullah/deepseek-r1-distill-qwen-1.5b-gguf

2B • Updated Nov 29, 2025 • 7

Kaleemullah/deepseek-r1-distill-qwen-7b-gguf

8B • Updated Nov 29, 2025 • 9

Kaleemullah/DeepSeek-R1-Distill-Llama-8B-gguf

8B • Updated Nov 29, 2025 • 8

nmnth/gemma-3-1b-extract-rating-lora-merged-Q8_0-GGUF

1.0B • Updated Nov 29, 2025 • 3

hellstone1918/Llama-3.2-3B-basic-lora-model

3B • Updated Nov 30, 2025 • 4

mburaksayici/golden_generate_qwen_0.6b_v2

Updated Nov 30, 2025

mburaksayici/golden_generate_qwen_0.6b_v2_gguf

0.6B • Updated Nov 30, 2025 • 30

mahdishahsavari/gpt-oss-20B-finetune-gguf

21B • Updated Nov 29, 2025 • 13

Rotan-Mohamad9/Healthcare_Model

8B • Updated Jan 15 • 6

tzu98/mistral-12B-wux-16

Updated Nov 30, 2025 • 10

tzu98/mistral-12B-wux-q4

Updated Nov 30, 2025 • 10

jacqueasd/Mantrika-Gemma3-4B-GGUF

4B • Updated Nov 30, 2025 • 1

jacobbista/llama3-3b-finetome

3B • Updated Nov 30, 2025 • 6

astegaras/merged_kaggle

3B • Updated Nov 30, 2025 • 2

0b10headedcalf/shaderWrap-Qwen2.5CoderGGUF

15B • Updated Mar 25 • 27

0b10headedcalf/tinyllama

1B • Updated Dec 9, 2025 • 2

moxin-org/Qwen3-Next-80B-A3B-Instruct-GGUF

Text Generation • 80B • Updated Dec 1, 2025 • 78 • 3