Inference Providers
Active filters: fp4
RedHatAI/Llama-3.1-70B-Instruct-NVFP4A16
Text Generation
• 41B • Updated • 9
RedHatAI/Qwen3-32B-NVFP4A16
Text Generation
• 19B • Updated • 804
• 2
nvidia/Qwen3-235B-A22B-NVFP4
Text Generation
• 133B • Updated • 9.46k
• 15
nvidia/Qwen3-30B-A3B-NVFP4
Text Generation
• 16B • Updated • 266k
• 28
RedHatAI/Llama-4-Scout-17B-16E-Instruct-NVFP4
Text Generation
• 64B • Updated • 1.4k
• 1
apolloparty/Qwen3-4B-NVFP4A16
2B • Updated • 1
Tonic/petite-elle-L-aime-3-sft
Text Generation
• 3B • Updated • 19
• 1
mradermacher/petite-elle-L-aime-3-sft-GGUF
Text Generation
• 3B • Updated • 454
• 1
nm-testing/DeepSeek-R1-Distill-Qwen-32B-NVFP4
Text Generation
• 19B • Updated • 881
• 2
Text Generation
• 2B • Updated • 9
2imi9/Qwen3-1.7B-NVFP4A16
Text Generation
• 1B • Updated • 6
• 1
ELVISIO/Qwen3-8B-NVFP4A16
Text Generation
• 5B • Updated • 6
RedHatAI/Llama-3.3-70B-Instruct-NVFP4
Text Generation
• 41B • Updated • 712
• 1
imgailab/flux1-trtx-dev-fp4-blackwell
Updated • 7
• 1
imgailab/flux1-trtx-schnell-fp4-blackwell
Updated • 5
• 1
llmat/Mistral-7B-Instruct-v0.3-NVFP4
Text Generation
• 4B • Updated • 16
llmat/Mistral-Small-Instruct-2409-NVFP4
Text Generation
• 13B • Updated • 8
2imi9/gpt-oss-20B-NVFP4A16-BF16
Text Generation
• 21B • Updated • 615
• 4
nvidia/Phi-4-multimodal-instruct-NVFP4
4B • Updated • 1.59k
• 10
nvidia/Phi-4-reasoning-plus-NVFP4
8B • Updated • 1.35k
• 8
nvidia/Llama-3.1-8B-Instruct-NVFP4
5B • Updated • 112k
• 8
Text Generation
• 5B • Updated • 29.6k
• 16
Text Generation
• 8B • Updated • 147k
• 7
Text Generation
• 17B • Updated • 91.1k
• 14
nvidia/Qwen2.5-VL-7B-Instruct-NVFP4
Text Generation
• 5B • Updated • 5.46k
• 14
xxrjun/DeepSeek-R1-0528-FP4
394B • Updated • 5
Sunbird/Sunflower-14B-4bit-fp4-bnb
Text Generation
• 15B • Updated • 2
Sunbird/Sunflower-32B-4bit-fp4-bnb
Text Generation
• 33B • Updated • 20
RedHatAI/Qwen3-VL-235B-A22B-Instruct-NVFP4
Text Generation
• 133B • Updated • 3.47k
• 15
RedHatAI/Llama-3.1-8B-Instruct-NVFP4
Text Generation
• 5B • Updated • 19.3k
• 1