Inference Providers
Active filters: nvfp4
llmat/Qwen3-30B-A3B-NVFP4
Text Generation
• 17B • Updated • 1
Text Generation
• 19B • Updated Text Generation
• 9B • Updated • 5
Text Generation
• 5B • Updated • 3
Text Generation
• 3B • Updated • 114
Text Generation
• 1B • Updated • 317
Text Generation
• 0.6B • Updated • 189
2imi9/gpt-oss-20B-NVFP4A16-BF16
Text Generation
• 21B • Updated • 888
• 5
llmat/Apertus-8B-Instruct-2509-NVFP4
Text Generation
• 5B • Updated • 4
• 1
mratsim/Seed-OSS-36B-Instruct-NVFP4
Text Generation
• 21B • Updated • 37
• 4
mratsim/Wayfarer-Large-70B-NVFP4
Text Generation
• 41B • Updated • 2
• 1
Text Generation
• 41B • Updated mratsim/Anubis-70B-v1.1-NVFP4
Text Generation
• 41B • Updated • 2
• 1
mratsim/L3.3-Ignition-v0.1-70B-NVFP4
Text Generation
• 41B • Updated mratsim/GoldDiamondGold-L33-70B-NVFP4
Text Generation
• 41B • Updated • 15
mratsim/Strawberrylemonade-L3-70B-v1.1-NVFP4
Text Generation
• 41B • Updated • 8
mratsim/Wayfarer-Large-70B-NVFP4A16
Text Generation
• 41B • Updated • 4
mratsim/Nova-70B-NVFP4A16
Text Generation
• 41B • Updated mratsim/Anubis-70B-v1.1-NVFP4A16
Text Generation
• 41B • Updated • 5
mratsim/GoldDiamondGold-L33-70B-NVFP4A16
Text Generation
• 41B • Updated • 3
mratsim/L3.3-Ignition-v0.1-70B-NVFP4A16
Text Generation
• 41B • Updated • 1
mratsim/Strawberrylemonade-L3-70B-v1.1-NVFP4A16
Text Generation
• 41B • Updated guerilla7/Foundation-Sec-8B-Instruct-NVFP4-quantized
5B • Updated Ex0bit/Qwen3-VLTO-32B-Instruct-NVFP4
Text Generation
• 17B • Updated • 27
• 1
Ex0bit/Qwen3-VLTO-32B-Instruct-NVFP4-256K
Text Generation
• 17B • Updated • 59
• 1
kaitchup/Qwen3-VL-2B-Instruct-W4A16
0.9B • Updated • 15
• 2
kaitchup/Qwen3-VL-8B-Instruct-W4A16
2B • Updated • 55
kaitchup/Qwen3-VL-8B-Instruct-NVFP4
2B • Updated • 185
prithivMLmods/Nanonets-OCR2-3B-AWQ-nvfp4
Image-Text-to-Text
• 3B • Updated • 36
mratsim/Codex-24B-Small-3.2-NVFP4
Text Generation
• 14B • Updated • 1