Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

646

Base only

Active filters: pruning

MilyaShams/Qwen3-1.7B-AutoRound_W4A16_ign

2B • Updated May 17 • 2

MilyaShams/Qwen3-1.7B-SmoothQuant_0.4_AutoRound_W8A8_ign

2B • Updated May 17 • 3

MilyaShams/Qwen3-1.7B-SmoothQuant_0.6_AutoRound_W8A8_ign

2B • Updated May 17 • 2

exdysa/MiniMax-M2.7-REAP-139B-A10B-MLX-4bit

Text Generation • 139B • Updated May 19 • 441

MilyaShams/Qwen3-1.7B-AutoRound_W8A8_ign

2B • Updated May 23 • 3

MilyaShams/Qwen3-1.7B-SmoothQuant_0.5_PTQ_W4A16

2B • Updated May 23 • 3

MilyaShams/Qwen3-1.7B-SmoothQuant_0.6_PTQ_W4A16

2B • Updated May 23 • 3

MilyaShams/Qwen3-1.7B-SmoothQuant_0.5_GPTQ_W4A16

2B • Updated May 23 • 2

MilyaShams/Qwen3-1.7B-SmoothQuant_0.6_GPTQ_W8A8

2B • Updated May 23 • 3

MilyaShams/Qwen3-1.7B-SmoothQuant_0.5_AWQ_W8A8

2B • Updated May 23 • 4

MilyaShams/Qwen3-1.7B-SmoothQuant_0.5_AWQ_W4A16

2B • Updated May 24 • 3

exdysa/MiniMax-M2.7-REAP-139B-A10B-MLX-5bit

Text Generation • 139B • Updated May 25 • 550

enCoder/qwen3-5-4b-mlp7808-distilled

Text Generation • Updated May 26 • 5

PJRM/MiniMax-M2.5-tiny-24e-IQ4_NL-GGUF

Text Generation • 4B • Updated May 28 • 392 • 1

nadizik/Qwen2.5-1.5B-Instruct-Code-En-GGUF-789MB

Text Generation • 1B • Updated May 30 • 112

potto007/gemma-4-19B-A4B-text-REAP-GGUF

Text Generation • 20B • Updated May 30 • 446

poolside-laguna-hackathon/laguna-martini

Text Generation • Updated about 1 month ago

mradermacher/Qwen3-Coder-57B-GGUF

57B • Updated about 1 month ago • 216

mradermacher/Qwen3-Coder-57B-i1-GGUF

57B • Updated about 1 month ago • 253

mradermacher/Qwen3-Coder-64B-GGUF

64B • Updated 29 days ago • 1.02k • 3

mradermacher/Gemma-4-19B-GGUF

18B • Updated 30 days ago • 517 • 1

mradermacher/Gemma-4-21B-GGUF

21B • Updated 30 days ago • 779 • 1

mradermacher/Gemma-4-19B-i1-GGUF

18B • Updated 30 days ago • 3.64k • 1

mradermacher/Gemma-4-21B-i1-GGUF

21B • Updated 29 days ago • 4.3k • 2

mradermacher/Qwen3-Coder-64B-i1-GGUF

64B • Updated 29 days ago • 2.57k • 1

DjeDjeB/Qwen3.6-28B-REAP20-A3B-GGUF

Text Generation • 28B • Updated 29 days ago • 967

SPAISS6F1/qwen-1b-pruned-th

Text Generation • 2B • Updated 24 days ago • 67

SPAISS6F1/gemma-1b-pruned-th

Text Generation • Updated 24 days ago • 108

SrogiLesnik/Gemma-4-19B-mlx-4Bit

Text Generation • 18B • Updated 20 days ago • 70

DJLougen/Qwen3.6-35B-A3B-REAP-90pct

Text Generation • 6B • Updated 16 days ago • 332 • 12