Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

741

Base only

Active filters: modelopt

Cirrascale/Kimi-K2.5-NVFP4

Text Generation • Updated Feb 12 • 10

Cirrascale/Qwen3-Coder-480B-A35B-Instruct-NVFP4

Text Generation • 241B • Updated Feb 12 • 7

switzerchees/LightOnOCR-2-1B-NVFP4

Image-Text-to-Text • 0.8B • Updated Feb 14 • 24

lukealonso/MiniMax-M2.5-NVFP4

130B • Updated Apr 12 • 9.93k • 45

txn545/Qwen3-Coder-Next-NVFP4

Updated Feb 14 • 8 • 1

mconcat/Trinity-Large-TrueBase-NVFP4

202B • Updated Feb 20 • 305 • 1

mconcat/MiniMax-M2.5-NVFP4

130B • Updated Feb 24 • 7

lukealonso/GLM-5-NVFP4

425B • Updated Feb 17 • 28 • 9

alphakek/GLM-4.7-Flash-heretic-NVFP4

Text Generation • 17B • Updated Feb 17 • 10 • 3

Seitaro-Yabuta/MN-Violet-Lotus-12B-NVFP4

Text Generation • 7B • Updated Feb 16 • 5

mgoin/Qwen3-0.6B-MXFP8

0.6B • Updated Feb 16 • 370

vincentzed-hf/Qwen3.5-397B-A17B-NVFP4

Image-Text-to-Text • Updated Feb 17 • 11 • 11

Simplismart/Llama-3.1-8B-Lexi-Uncensored-V2-NVFP4

5B • Updated Feb 19 • 6

TusharGoel/Lexi-Llama-Uncensored-NVFP4

5B • Updated Feb 19 • 3

jaival-nvidia/Step-3.5-Flash-NVFP4

111B • Updated Feb 23 • 2

Banana-Bae/Qwen3-235B-A22B-Instruct-2507-REAP-nvfp4

Text Generation • 90B • Updated Feb 21 • 11 • 1

surogate/Qwen3-0.6B-NVFP4

0.4B • Updated Feb 23 • 4

Cirrascale/Qwen3.5-397B-A17B-NVFP4

Text Generation • Updated Feb 23 • 8

huxiang088/OLMoE-1B-7B-0924-Instruct-NVFP4

4B • Updated Feb 25 • 7

mconcat/Trinity-Large-Base-NVFP4

202B • Updated Feb 24 • 6

sbull-dell/Qwen3-Coder-Next-NVFP4

Updated Feb 24 • 4

txn545/Qwen3.5-35B-A3B-NVFP4

Text Generation • Updated Mar 1 • 20.3k • 6

txn545/Qwen3.5-27B-NVFP4

Text Generation • 17B • Updated Mar 1 • 166 • 1

osoleve/Qwen3.5-27B-Text-NVFP4-MTP

Text Generation • 17B • Updated Mar 5 • 1.51k • 19

mmangkad/Qwen3.5-27B-NVFP4

Text Generation • 20B • Updated Apr 15 • 17 • 1

surogate/Qwen3.5-27B-NVFP4

Text Generation • 17B • Updated Feb 28 • 26

pirola/GLM-4.7-Flash-REAP-23B-A3B-NVFP4

13B • Updated Feb 28 • 5 • 1

ApacheOne/LFM2.5-1.2B-Z-Image-Engineer-V4-nvfp4

0.7B • Updated Feb 28 • 5

0x8badbeef/Aperture-Think-v1-nvfp4

4B • Updated Feb 28 • 3

trohrbaugh/Qwen3.5-122B-A10B-heretic-nvfp4

Image-Text-to-Text • 62B • Updated Apr 14 • 375