Inference Providers
Active filters: modelopt
lukealonso/MiniMax-M2-NVFP4
115B • Updated • 51
• 14
Text Generation
• 7B • Updated • 7
• 1
leatan95/Tongyi-DeepResearch-30B-A3B-NVFP4
16B • Updated • 2
DataSnake/Wayfarer-12B-NVFP4
Text Generation
• 7B • Updated • 5
• 1
DataSnake/Wayfarer-2-12B-NVFP4
Text Generation
• 7B • Updated • 6
• 2
Ex0bit/OLMo-3-7B-Instruct-NVFP4-1M
Text Generation
• 4B • Updated • 10
• 2
wangqia0309/Captain-Eris_Violet-V0.420-12B-FP8-KV-modelopt
12B • Updated • 8
rahtml/Qwen3-Coder-30B-A3B-Instruct-NVFP4
16B • Updated • 2
nvidia/Kimi-K2-Thinking-NVFP4
Text Generation
• Updated • 7.87k
• 30
eousphoros/DeepSeek-V3.2-NVFP4
Text Generation
• 387B • Updated • 16
• 5
zhuyksir/qwen3_30b_a3b_nvfp4_baseline
16B • Updated • 1
zhuyksir/qwen3_30b_a3b_nvfp4_qat
16B • Updated • 1
alphatozeta/sglang_glm_4_6_fp4_modelopt
177B • Updated • 2
ericlewis/Nemotron-Orchestrator-8B-NVFP4
Text Generation
• 5B • Updated • 4
nvidia/Qwen3-Next-80B-A3B-Instruct-NVFP4
Text Generation
• Updated • 19.4k
• 39
trithemius/Velvet-14B-nvfp4
8B • Updated • 1
nvidia/Qwen3-Next-80B-A3B-Thinking-NVFP4
Text Generation
• Updated • 3.68k
• 59
OPENZEKA/Qwen3-4B-Instruct-2507-NVFP4
2B • Updated • 152
Z841973620/Qwen3-30B-A3B-NVFP4
Text Generation
• 16B • Updated • 1
Z841973620/Qwen3-30B-A3B-FP8
Text Generation
• 31B • Updated • 2
OPENZEKA/Qwen3-Coder-30B-A3B-Instruct-NVFP4
Text Generation
• 16B • Updated • 9.82k
josephdowling10/Mixtral-8x7B-Instruct-v0.1-NVFP4
Text Generation
• 23B • Updated • 48
taharmasmaliyev07/Llama-2-7b-hf-fp8
7B • Updated • 1
OPENZEKA/Qwen3-Coder-480B-A35B-Instruct-NVFP4
241B • Updated • 9
Shifusen/Llama-3.3-70B-Instruct-abliterated-NVFP4-modelopt
36B • Updated • 5
taharmasmaliyev07/Mistral-7B-v0.1-fp8
7B • Updated • 1
taharmasmaliyev07/Llama-3.1-8B-fp8
8B • Updated • 1
taharmasmaliyev07/gemma-2-9b-it-fp8
9B • Updated • 1
cybermotaz/qwen3-vl-2b-thinking-nvfp4-w4a16
Image-Text-to-Text
• 2B • Updated • 22
• 1
cybermotaz/qwen3-vl-4b-thinking-nvfp4-w4a16
Image-Text-to-Text
• 3B • Updated • 5
• 1