Inference Providers
Active filters: nvfp4
Shifusen/Strawberrylemonade-L3-70B-v1.2-NVFP4
Text Generation
• 41B • Updated • 1
cybermotaz/qwen3-vl-2b-thinking-nvfp4-w4a16
Image-Text-to-Text
• 2B • Updated • 8
• 1
cybermotaz/qwen3-vl-4b-thinking-nvfp4-w4a16
Image-Text-to-Text
• 3B • Updated • 481
• 1
cybermotaz/qwen3-vl-8b-thinking-nvfp4-w4a16
Image-Text-to-Text
• 5B • Updated • 88
• 2
Shifusen/72B-Qwen2.5-Kunou-v1-NVFP4
Text Generation
• 42B • Updated • 5
Shifusen/L3.3-The-Omega-Directive-70B-Unslop-v2.1-NVFP4
Text Generation
• 41B • Updated • 8
mratsim/Hearthfire-24B-NVFP4
Text Generation
• 14B • Updated • 2
mratsim/Hearthfire-24B-NVFP4A16
Text Generation
• 14B • Updated • 3
• 1
ealexeev/TheDrummer-Snowpiercer-15B-v4-NVFP4
Text Generation
• 9B • Updated • 5
Shifusen/Forgotten-Safeword-70B-v5.0-NVFP4
Text Generation
• 41B • Updated • 1
• 1
Shifusen/Draconic-Tease-70B-NVFP4
41B • Updated ealexeev/TheDrummer-Cydonia-24B-v4.3-NVFP4
Text Generation
• 14B • Updated • 153
cybermotaz/Qwen3-VL-32B-Instruct-NVFP4
Image-Text-to-Text
• 18B • Updated • 35.8k
Shifusen/dolphin-2.9.1-llama-3-70b-NVFP4-vllm
41B • Updated • 7
Shifusen/Negative_LLAMA_70B-NVFP4
Text Generation
• 41B • Updated • 14
• 1
cybermotaz/Qwen3-Omni-30B-A3B-Instruct-NVFP4
Text Generation
• Updated • 7
Shifusen/L3.3-70B-PippaMaid-1.0-NVFP4
Text Generation
• 41B • Updated • 4
GadflyII/Qwen3-VL-235B-A22B-Instruct-NVFP4
Image-Text-to-Text
• 133B • Updated • 10
GadflyII/Qwen3-VL-235B-A22B-Thinking-NVFP4
Image-Text-to-Text
• 133B • Updated • 121
nvidia/DeepSeek-V3.2-NVFP4
Text Generation
• 394B • Updated • 32.9k
• 15
nvidia/Qwen3-235B-A22B-Thinking-2507-NVFP4
Text Generation
• 120B • Updated • 22.8k
• 8
nvidia/Qwen3-235B-A22B-Instruct-2507-NVFP4
Text Generation
• 120B • Updated • 2.09k
• 9
Firworks/Llama-3.3-8B-Instruct-nvfp4
5B • Updated • 216
Firworks/Cydonia-24B-v4.3-heretic-nvfp4
14B • Updated • 77
Firworks/Llama-3.3-70B-Joyous-nvfp4
41B • Updated • 3
Shifusen/Llama-3.3-70B-Instruct-NVFP4
Firworks/Solar-Open-100B-nvfp4
58B • Updated • 10
• 6
Firworks/IQuest-Coder-V1-40B-Instruct-nvfp4
23B • Updated • 1
• 2
GadflyII/MiniMax-M2.1-NVFP4
Text Generation
• 129B • Updated • 15
• 7
Firworks/Qwen2.5-3B-Instruct-Reticent-nvfp4