-
-
-
-
-
-
Inference Providers
Active filters:
modelopt
DataSnake/Wayfarer-12B-NVFP4
Text Generation
•
7B
•
Updated
•
2
DataSnake/Wayfarer-2-12B-NVFP4
Text Generation
•
7B
•
Updated
•
83
Ex0bit/OLMo-3-7B-Instruct-NVFP4-1M
Text Generation
•
4B
•
Updated
•
8
•
1
wangqia0309/Captain-Eris_Violet-V0.420-12B-FP8-KV-modelopt
12B
•
Updated
•
24
rahtml/Qwen3-Coder-30B-A3B-Instruct-NVFP4
16B
•
Updated
•
65
eousphoros/DeepSeek-V3.2-NVFP4
Text Generation
•
387B
•
Updated
•
330
•
5
zhuyksir/qwen3_30b_a3b_nvfp4_baseline
16B
•
Updated
zhuyksir/qwen3_30b_a3b_nvfp4_qat
16B
•
Updated
alphatozeta/sglang_glm_4_6_fp4_modelopt
177B
•
Updated
•
32
ericlewis/Nemotron-Orchestrator-8B-NVFP4
Text Generation
•
5B
•
Updated
•
19
trithemius/Velvet-14B-nvfp4
8B
•
Updated
•
2
OPENZEKA/Qwen3-4B-Instruct-2507-NVFP4
2B
•
Updated
•
125
Z841973620/Qwen3-30B-A3B-NVFP4
Text Generation
•
16B
•
Updated
•
9
Z841973620/Qwen3-30B-A3B-FP8
Text Generation
•
31B
•
Updated
OPENZEKA/Qwen3-Coder-30B-A3B-Instruct-NVFP4
Text Generation
•
16B
•
Updated
•
3.91k
josephdowling10/Mixtral-8x7B-Instruct-v0.1-NVFP4
Text Generation
•
23B
•
Updated
•
14
taharmasmaliyev07/Llama-2-7b-hf-fp8
7B
•
Updated
•
1
OPENZEKA/Qwen3-Coder-480B-A35B-Instruct-NVFP4
241B
•
Updated
•
43
Shifusen/Llama-3.3-70B-Instruct-abliterated-NVFP4-modelopt
36B
•
Updated
•
55
taharmasmaliyev07/Mistral-7B-v0.1-fp8
7B
•
Updated
taharmasmaliyev07/Llama-3.1-8B-fp8
8B
•
Updated
•
1
taharmasmaliyev07/gemma-2-9b-it-fp8
9B
•
Updated
cybermotaz/qwen3-vl-2b-thinking-nvfp4-w4a16
Image-Text-to-Text
•
2B
•
Updated
•
51
•
1
cybermotaz/qwen3-vl-4b-thinking-nvfp4-w4a16
Image-Text-to-Text
•
3B
•
Updated
cybermotaz/qwen3-vl-8b-thinking-nvfp4-w4a16
Image-Text-to-Text
•
5B
•
Updated
•
124
•
1
CedricHwang/qwen2.5-0.5b-modelopt-fp8-pc-pt
Text Generation
•
0.5B
•
Updated
•
46
CedricHwang/qwen2.5-0.5b-modelopt-fp8-pb-wo
0.5B
•
Updated
•
55
stepnoy/gpt-oss-120b-NVFP4
117B
•
Updated
•
26
baseten-admin/glm-4.7-fp4
183B
•
Updated
•
3.23k
ericlewis/functiongemma-270m-it-nvfp4
0.2B
•
Updated