Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

930

Base only

Active filters: nvfp4

nvidia/Gemma-4-26B-A4B-NVFP4

Text Generation • 14B • Updated May 11 • 2M • 97

0xSero/GLM-5.2-504B

Text Generation • 290B • Updated about 4 hours ago • 5.23k • 14

nvidia/MiniMax-M3-NVFP4

Text Generation • 247B • Updated about 2 hours ago • 185 • 14

nvidia/DeepSeek-V4-Flash-NVFP4

Text Generation • 167B • Updated 11 days ago • 202k • 43

madeby561/GLM-5.2-NVFP4-REAP-504B-term

Text Generation • 290B • Updated 3 days ago • 1.09k • 13

michaelw9999/Qwen3.6-27B-NVFP4-MTP-GGUF

27B • Updated 19 days ago • 41.9k • 29

sakamakismile/gemma-4-12B-coder-fable5-composer2.5-MTP-NVFP4

Text Generation • 7B • Updated 10 days ago • 3.97k • 45

nvidia/diffusiongemma-26B-A4B-it-NVFP4

Text Generation • 14B • Updated 15 days ago • 861k • 84

madeby561/GLM-5.2-NVFP4-REAP-504B

Text Generation • 290B • Updated 4 days ago • 328 • 8

rdtand/Qwen3.6-27B-PrismaAURA-5.5bit-vllm

20B • Updated about 20 hours ago • 49 • 7

AEON-7/Qwen3.6-27B-AEON-Ultimate-Uncensored-Multimodal-NVFP4-MTP-XS

Text Generation • 17B • Updated 1 day ago • 43.5k • 51

poolside/Laguna-M.1-NVFP4

Text Generation • 131B • Updated 5 days ago • 2.71k • 10

DJLougen/Qwable-5-27B-Coder-NVFP4

Text Generation • 15B • Updated 2 days ago • 383 • 5

AEON-7/Qwen3.6-27B-AEON-Ultimate-Uncensored-NVFP4

Text Generation • 19B • Updated about 18 hours ago • 49.8k • 78

sakamakismile/Huihui-Qwen3.6-27B-abliterated-NVFP4-MTP

Text Generation • 17B • Updated 25 days ago • 62.6k • 63

rdtand/Qwen3.6-27B-PrismaSCOUT-Blackwell-NVFP4-BF16-vllm

17B • Updated May 4 • 83.6k • 31

srv-sngh/gemma-4-12B-coder-fable5-composer2.5-nvfp4

Text Generation • 2B • Updated about 5 hours ago • 4.41k • 9

SummonGovernance/Qwable-27B-NVFP4-MTP-GGUF

Text Generation • 27B • Updated 9 days ago • 1.33k • 4

brandonmusic/GLM-5.2-NVFP4-REAP-Recall-N172

Text Generation • 296B • Updated 1 day ago • 562 • 4

DreamFast/gemma-3-12b-it-heretic-v2

Text Generation • 12B • Updated Mar 10 • 7.81k • • 49

RedHatAI/Qwen3.6-35B-A3B-NVFP4

20B • Updated 25 minutes ago • 2.62M • 157

saricles/MiniMax-M2.7-REAP-172B-A10B-NVFP4-GB10

Text Generation • 87B • Updated Apr 19 • 6.82k • 28

sakamakismile/Qwen3.6-27B-Text-NVFP4-MTP

Text Generation • 17B • Updated Apr 29 • 439k • 77

AEON-7/Qwen3.6-27B-AEON-Ultimate-Uncensored-Multimodal-NVFP4-MTP

Text Generation • 20B • Updated 1 day ago • 37k • 21

llmfan46/Qwen3.6-27B-uncensored-heretic-v2-Native-MTP-Preserved-NVFP4-GGUF

Image-Text-to-Text • 27B • Updated May 7 • 18.4k • 29

AxionML/Gemma-4-12B-NVFP4

Image-Text-to-Text • 8B • Updated 22 days ago • 46.6k • 6

Winnougan/LTX-2.3-INT8

Updated 3 days ago • 11

DreamFast/qwen3-8b-heretic

Text Generation • 8B • Updated Mar 20 • 3.85k • • 15

RecViking/Mistral-Medium-3.5-128B-NVFP4

74B • Updated May 9 • 108k • 9

Brian6145/Qwen3.6-27B-Claude-Opus-Sonnet-Distilled-NVFP4-MTP

Text Generation • 20B • Updated May 8 • 18.8k • 37