Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

nvidia-modelopt

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

17

Base only

Active filters: nvidia-modelopt

r0b0tlab/VibeThinker-3B-NVFP4

Text Generation • 2B • Updated 15 days ago • 566 • 3

cloud19/sainemo-12b-fp4-blackwell

12B • Updated Feb 2 • 2

1kxia/Qwen3-Embedding-0.6B-modelopt-fp8

Feature Extraction • 0.6B • Updated Feb 11 • 249

1kxia/gemma-3-270m-modelopt-fp8

Feature Extraction • 0.3B • Updated Feb 11 • 253

Banana-Bae/Qwen3-235B-A22B-Instruct-2507-REAP-nvfp4

Text Generation • 90B • Updated Feb 21 • 14 • 1

vistralis/FLUX.2-klein-4b-INT8-transformer-quants

Text-to-Image • Updated Feb 25 • 20

vistralis/FLUX.2-klein-base-4b-INT8-transformer-quants

Text-to-Image • Updated Feb 25 • 28

kleinpanic93/Qwen3-Coder-30B-A3B-Instruct-NVFP4

Text Generation • 31B • Updated Mar 4 • 31 • 1

2imi9/olmoearth-nano-fp8

Feature Extraction • Updated Mar 25

2imi9/olmoearth-nano-fp4

Feature Extraction • Updated Mar 25

vrfai/Qwen3-ASR-1.7B-fp8

Automatic Speech Recognition • 2B • Updated 26 days ago • 544 • 5

vrfai/Qwen3-ASR-1.7B-nvfp4

Automatic Speech Recognition • 1B • Updated 26 days ago • 155 • 5

vrfai/Qwen3-ASR-1.7B-int8

Automatic Speech Recognition • 2B • Updated 26 days ago • 3

vrfai/Qwen3-ASR-1.7B-int4

Automatic Speech Recognition • 2B • Updated 26 days ago • 3

Shashwat42/Qwen3.6-27B-VLM-NVFP4

Image-Text-to-Text • Updated 24 days ago • 185

jeffpeng3/cohere-transcribe-03-2026-NVFP4

Automatic Speech Recognition • 1B • Updated 11 days ago • 37

r0b0tlab/Agents-A1-NVFP4

Text Generation • 19B • Updated about 5 hours ago