Inference Providers
Active filters: nvfp4
mratsim/Codex-24B-Small-3.2-NVFP4
Text Generation
• 14B • Updated • 1
mratsim/Codex-24B-Small-3.2-NVFP4A16
Text Generation
• 14B • Updated • 8
mratsim/Dans-PersonalityEngine-V1.3.0-24b-NVFP4A16
Text Generation
• 14B • Updated • 5
mratsim/Dans-PersonalityEngine-V1.3.0-24b-NVFP4
Text Generation
• 14B • Updated • 2
mratsim/WeirdCompound-v1.7-24b-NVFP4A16
Text Generation
• 14B • Updated • 1
mratsim/WeirdCompound-v1.7-24b-NVFP4
Text Generation
• 14B • Updated • 15
• 1
mratsim/Behemoth-X-123B-v2-NVFP4
Text Generation
• 69B • Updated • 81
• 4
mratsim/Monstral-123B-v2-NVFP4
Text Generation
• 69B • Updated • 39
• 1
mratsim/Dungeonmaster-V2.2-Expanded-LLaMa-70B-NVFP4
Text Generation
• 41B • Updated • 3
mratsim/Dungeonmaster-V2.2-Expanded-LLaMa-70B-NVFP4A16
Text Generation
• 41B • Updated • 2
mratsim/70B-L3.3-Cirrus-x1-NVFP4A16
Text Generation
• 41B • Updated • 1
• 1
mratsim/70B-L3.3-Cirrus-x1-NVFP4
Text Generation
• 41B • Updated • 15
Text Generation
• 7B • Updated • 2
• 1
DataSnake/Wayfarer-12B-NVFP4
Text Generation
• 7B • Updated • 2
• 1
nvidia/DeepSeek-V3.1-NVFP4
Text Generation
• 394B • Updated • 14.4k
• 16
DataSnake/Wayfarer-2-12B-NVFP4
Text Generation
• 7B • Updated • 2
• 2
Ex0bit/OLMo-3-7B-Instruct-NVFP4-1M
Text Generation
• 4B • Updated • 2
• 2
GaleneAI/Qwen3-VL-235B-A22B-Thinking-NVFP4
Text Generation
• 133B • Updated • 361
• 1
kaitchup/Qwen3-VL-2B-Instruct-NVFP4
2B • Updated • 40
• 1
trithemius/Velvet-14B-nvfp4
8B • Updated • 11
coughmedicine/Huihui-Qwen3-Next-80B-A3B-Instruct-abliterated-nvfp4
Updated • 31
• 1
ealexeev/Mistral-Small-24B-NVFP4
Text Generation
• 14B • Updated • 46
josephdowling10/Mixtral-8x7B-Instruct-v0.1-NVFP4
Text Generation
• 23B • Updated • 30
Shifusen/L3.3-70B-Magnum-v4-SE-NVFP4
Text Generation
• 41B • Updated • 4
Firworks/Snowpiercer-15B-v4-nvfp4
9B • Updated • 4
cybermotaz/nemotron3-nano-nvfp4-w4a16
Text Generation
• 18B • Updated • 2.17k
• 13
ealexeev/The-Drummer-Magidonia-24B-v4.2.0-NVFP4
Text Generation
• 14B • Updated • 3
Shifusen/Strawberrylemonade-L3-70B-v1.2-NVFP4
Text Generation
• 41B • Updated • 1
cybermotaz/qwen3-vl-2b-thinking-nvfp4-w4a16
Image-Text-to-Text
• 2B • Updated • 8
• 1
cybermotaz/qwen3-vl-4b-thinking-nvfp4-w4a16
Image-Text-to-Text
• 3B • Updated • 364
• 1