Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

1,821

Base only

Active filters: quantization

vadery/Qwen3.5-0.8B-W8A8

Image-Text-to-Text • 0.9B • Updated May 22 • 7

tturing/Qwen3-Omni-30B-A3B-Thinking-FP8

Any-to-Any • 32B • Updated May 22 • 1.11k • 1

tsolful/Z-Image-L2P-INT8

Text-to-Image • Updated May 26 • 13

FaustusFaber31/gemma-4-E4B-it-awq-int4-xlam

Updated May 23 • 1

FaustusFaber31/Qwen3.5-4B-awq-int4-xlam

YTan2000/Qwopus3.6-27B-v2-TQ3_4S

Image-Text-to-Text • 27B • Updated May 23 • 415 • 13

MilyaShams/Qwen3-1.7B-AutoRound_W8A8_ign

2B • Updated May 23 • 5

prokopsafranek/gemma-4-26B-A4B-it-GGUF

Text Generation • 25B • Updated 22 days ago • 1.13k • 1

MilyaShams/Qwen3-1.7B-SmoothQuant_0.5_PTQ_W4A16

2B • Updated May 23 • 5

MilyaShams/Qwen3-1.7B-SmoothQuant_0.6_PTQ_W4A16

2B • Updated May 23 • 4

MilyaShams/Qwen3-1.7B-SmoothQuant_0.5_GPTQ_W4A16

2B • Updated May 23 • 4

MilyaShams/Qwen3-1.7B-SmoothQuant_0.6_GPTQ_W8A8

2B • Updated May 23 • 4

MilyaShams/Qwen3-1.7B-SmoothQuant_0.5_AWQ_W8A8

2B • Updated May 23 • 6

MilyaShams/Qwen3-1.7B-SmoothQuant_0.5_AWQ_W4A16

2B • Updated May 24 • 4

morphicode-jp/gemma-4-31B-it-L25L26x1.5-IQ1_M

Text Generation • 31B • Updated 15 days ago • 664 • 1

morphicode-jp/gemma-4-31B-it-L25L26x1.5-Q2_K

Text Generation • 31B • Updated 15 days ago • 312 • 2

morphicode-jp/gemma-4-31B-it-L25L26x1.5-Q4_K_M

Text Generation • 31B • Updated 15 days ago • 179 • 1

okasi/gliner2-privacy-filter-pii-multi-onnx

Token Classification • Updated May 24 • 48 • 1

YihongJin/Qwen3-Omni-30B-A3B-Instruct-NVFP4-W4A16-awq

20B • Updated 25 days ago • 42

YihongJin/Qwen3-Omni-30B-A3B-Instruct-NVFP4-W4A16-max

20B • Updated 25 days ago • 25

ShahzebKhoso/Qwen3-8B-GGUF

Text Generation • 8B • Updated May 25 • 46

ankitjakhar/LLaDA-8B-Quantized

Updated 29 days ago

ShahzebKhoso/gemma-3-4b-it

Text Generation • 4B • Updated May 25 • 79

ShahzebKhoso/Qwen2.5-Coder-7B-Instruct

Text Generation • 8B • Updated May 25 • 354

ISTA-DASLab/Qwen3.5-4B-GGUF-GSQ

Text Generation • 4B • Updated about 1 month ago • 175

dangvansam/chandra-ocr-2-NVFP4A16

Image-Text-to-Text • 3B • Updated May 26 • 777

dangvansam/chandra-ocr-2-NVFP4

Image-Text-to-Text • 3B • Updated May 26 • 434

Vineetha00/synapnet-edge

qtj1999/quip-qat-ternary-qwen3-1p7b

dj-oyu/Irodori-TTS-500M-v3-AX8850

Text-to-Speech • Updated about 1 month ago