Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

11,598

Base only

Active filters: quantized

ReallyFloppyPenguin/MiniCPM4-8B-GGUF

8B • Updated Jun 14, 2025 • 25

ReallyFloppyPenguin/Nemotron-Research-Reasoning-Qwen-1.5B-GGUF

2B • Updated Jun 14, 2025 • 39 • 1

Renugadevi82/cisco-nx-ai-4bit

1B • Updated Jun 16, 2025 • 1

LibraxisAI/QwQ-32B-MLX-Q5

Text Generation • 33B • Updated May 11 • 46

ReallyFloppyPenguin/OpenCodeReasoning-Nemotron-14B-GGUF

15B • Updated Jun 16, 2025 • 16 • 1

QuantStack/Wan2.1_T2V_14B_LightX2V_StepCfgDistill_VACE-GGUF

Image-to-Video • 17B • Updated Jun 17, 2025 • 600 • 25

ReallyFloppyPenguin/Jan-nano-GGUF

4B • Updated Jun 16, 2025 • 29

ReallyFloppyPenguin/Qwen2.5-Math-7B-GGUF

Updated Jun 16, 2025

ReallyFloppyPenguin/Qwen3-0.6B-GGUF

0.8B • Updated Jun 16, 2025 • 17

ReallyFloppyPenguin/Holo1-7B-GGUF

8B • Updated Jun 16, 2025 • 12

janni-t/qwen3-embedding-0.6b-int8-tei-onnx

Sentence Similarity • Updated Jun 17, 2025 • 242 • 2

LibraxisAI/Qwen3-14b-MLX-Q5

Text Generation • 15B • Updated May 11 • 153

steampunque/Qwen3-4B-MP-GGUF

4B • Updated Feb 18 • 19

ReallyFloppyPenguin/DeepSeek-R1-Distill-Qwen-32B-GGUF

33B • Updated Jul 5, 2025 • 1

ReallyFloppyPenguin/Gemma-3-Gaia-PT-BR-4b-it-GGUF

4B • Updated Jun 17, 2025 • 38

steampunque/Qwen3-32B-MP-GGUF

33B • Updated Apr 23 • 125 • 1

yukihamada/buzzquan-sensei-q8

Text Generation • Updated Jun 18, 2025 • 4

yukihamada/buzzquan-student-q8

Text Generation • Updated Jun 18, 2025 • 4

yukihamada/buzzquan-sensei-trained

4B • Updated Jun 18, 2025 • 1

ReallyFloppyPenguin/Qwen3-30B-A3B-GGUF

31B • Updated Jun 18, 2025 • 14

ReallyFloppyPenguin/II-Medical-8B-1706-GGUF

8B • Updated Jun 20, 2025 • 4

vlad-m-dev/distiluse-base-multilingual-v2-merged-onnx

Feature Extraction • Updated Oct 29, 2025 • 1

mehta/CooperLM-354M-4bit

Text Generation • 0.4B • Updated Jun 21, 2025 • 7 • 1

steampunque/Mistral-Small-3.2-24B-Instruct-2506-MP-GGUF

24B • Updated Feb 18 • 23 • 2

ReallyFloppyPenguin/Polaris-4B-Preview-GGUF

4B • Updated Jun 23, 2025 • 9

ReallyFloppyPenguin/Arch-Agent-7B-GGUF

8B • Updated Jun 23, 2025 • 26

ReallyFloppyPenguin/Nanonets-OCR-s-GGUF

3B • Updated Jun 23, 2025 • 26

kanrishaurus/llama3-8b-sahabatai-v1-instruct-GGUF

Text Generation • 8B • Updated Jun 23, 2025 • 9

steampunque/Qwen2.5-VL-7B-Instruct-MP-GGUF

8B • Updated Feb 18 • 160

TheMelonGod/Jan-nano-exl2

Text Generation • Updated Jun 30, 2025 • 4