Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

31,260

Base only

Active filters: 8-bit

nvidia/GLM-5.1-NVFP4

Text Generation • 382B • Updated May 27 • 60.5k • 41

nvidia/DeepSeek-V4-Pro-NVFP4

Text Generation • 910B • Updated 30 days ago • 101k • 72

0xSero/DeepSeek-V4-Flash-180B

Text Generation • 102B • Updated May 30 • 5.82k • 32

google/gemma-4-E4B-it-qat-mobile-transformers

Any-to-Any • 3B • Updated 3 days ago • 6.08k • 23

RedHatAI/diffusiongemma-26B-A4B-it-NVFP4

16B • Updated 6 days ago • 757k • 17

amd/GLM-5.2-MXFP4

412B • Updated 25 days ago • 40.5k • 14

maci0/Qwen3.6-40B-Claude-4.6-Opus-Deckard-Heretic-Uncensored-Thinking-NVFP4

Image-Text-to-Text • 23B • Updated 7 days ago • 3.19k • 9

georgeis55/Ornith-1.0-35B-MLX-oQ8

10B • Updated 6 days ago • 8.63k • 2

sakamakismile/Ornith-1.0-35B-NVFP4

Image-Text-to-Text • 20B • Updated 18 days ago • 239k • 16

fraserprice/DeepSeek-V4-Flash-Abliterated-DSpark

165B • Updated 12 days ago • 2.42k • 9

mlx-community/Ornith-1.0-35B-8bit

Image-Text-to-Text • 10B • Updated 16 days ago • 6.7k • 6

maci0/Qwopus3.6-27B-Coder-NVFP4

Image-Text-to-Text • 16B • Updated 7 days ago • 125k • 2

jarrelscy/GLM-5.2-NVFP4-AQLM-hybrid-500k

Text Generation • 268B • Updated about 16 hours ago • 451 • 2

jarrelscy/GLM-5.2-NVFP4-AQLM-hybrid-250k

Text Generation • 295B • Updated about 16 hours ago • 454 • 3

ailexleon/Gemma-4-Novelist-Eclipse-31B-mlx-8Bit

Text Generation • 32B • Updated 7 days ago • 366 • 2

Youssofal/Qwen3.6-27B-MTPLX-Optimized-Quality-FP16

8B • Updated 6 days ago • 494 • 2

morosystems/ThinkingCap-Qwen3.6-27B-NVFP4

Image-Text-to-Text • 15B • Updated 5 days ago • 2.46k • 2

t-prazak/ThinkingCap-Qwen3.6-27B-MLX-8bit

Image-Text-to-Text • 8B • Updated 6 days ago • 675 • 2

mitomtuna/MiMo-V2.5-0703-NVFP4

179B • Updated about 3 hours ago • 824 • 2

NAMAA-Space/cohere-transcribe-arabic-07-2026-int8

Automatic Speech Recognition • 2B • Updated 3 days ago • 74 • 2

Jiunsong/SuperGLM-5.2-abliterated-NVFP4

Text Generation • 381B • Updated 28 minutes ago • 2

MaziyarPanahi/Mistral-7B-Instruct-v0.3-GGUF

Text Generation • 7B • Updated May 22, 2024 • 114k • 147

HF1BitLLM/Llama3-8B-1.58-100B-tokens

Text Generation • 3B • Updated Sep 19, 2024 • 1.04k • 220

Zoont/faster-whisper-large-v3-turbo-int8-ct2

Automatic Speech Recognition • Updated Jan 31, 2025 • 300 • 15

LoneStriker/DeepSeek-R1-Distill-Llama-70B-8.0bpw-h8-exl2

Text Generation • Updated Feb 2, 2025 • 3 • 2

RedHatAI/Llama-3.1-70B-Instruct-NVFP4

Text Generation • 41B • Updated Nov 21, 2025 • 697 • 1

mlx-community/GLM-4.5-Air-8bit

Text Generation • 107B • Updated Jul 29, 2025 • 4.51k • 10

unsloth/gpt-oss-20b

Text Generation • 22B • Updated Aug 9, 2025 • 29.2k • 49

mshojaei77/gpt-oss-120b

120B • Updated Aug 11, 2025 • 565 • 2

openbmb/MiniCPM-V-4_5-int4

Image-Text-to-Text • 9B • Updated Mar 10 • 4.6k • 16