Inference Providers
Active filters: fp8
k-l-lambda/DeepSeek-V3.1-Terminus-FP4
397B • Updated • 3
• 1
xxrjun/DeepSeek-R1-0528-FP4
394B • Updated • 4
Glazkov/qwen2.5-vl-table-extraction-FP8-Dynamic
Image-to-Text
• 4B • Updated RedHatAI/Qwen3-VL-235B-A22B-Instruct-FP8-dynamic
Text Generation
• 236B • Updated • 91
• 4
QuantTrio/Qwen3-VL-235B-A22B-Instruct-FP8
Text Generation
• Updated • 86
QuantTrio/Qwen3-VL-235B-A22B-Thinking-FP8
Text Generation
• 236B • Updated • 74
Text Generation
• 685B • Updated • 11
RedHatAI/Qwen3-VL-235B-A22B-Instruct-FP8-block
Text Generation
• 236B • Updated • 34
• 3
deepseek-ai/DeepSeek-V3.2-Exp-Base
Text Generation
• 685B • Updated • 1.1k
• 66
JoshPP/DeepSeek-V3-16layers
153B • Updated • 62
1T • Updated RedHatAI/NVIDIA-Nemotron-Nano-9B-v2-FP8-dynamic
Text Generation
• 9B • Updated • 3.63k
• 3
Qwen/Qwen3-VL-235B-A22B-Instruct-FP8
Image-Text-to-Text
• 236B • Updated • 53.9k
• 43
Qwen/Qwen3-VL-235B-A22B-Thinking-FP8
Image-Text-to-Text
• 236B • Updated • 2.81k
• 28
Qwen/Qwen3-VL-30B-A3B-Instruct-FP8
Image-Text-to-Text
• Updated • 268k
• 105
Qwen/Qwen3-VL-30B-A3B-Thinking-FP8
Image-Text-to-Text
• 31B • Updated • 3.85k
• 53
kavanmevada/eng-word-model-ds32
Text Generation
• 2B • Updated • 1
GaleneAI/Magistral-Small-2509-FP8-Dynamic
Updated • 12
• 2
RedHatAI/Llama-3.1-8B-Instruct-FP8-block
Text Generation
• 8B • Updated • 49
Text-to-Image
• Updated • 2
yejingfu/prune-deepseek-v3.1-e32
99B • Updated Qwen/Qwen3-VL-4B-Thinking-FP8
Image-Text-to-Text
• 5B • Updated • 1.33k
• 30
Qwen/Qwen3-VL-8B-Thinking-FP8
Image-Text-to-Text
• 9B • Updated • 4.9k
• 32
Qwen/Qwen3-VL-8B-Instruct-FP8
Image-Text-to-Text
• 9B • Updated • 326k
• 67
Text Generation
• Updated • 3
Text Generation
• Updated • 11
nm-testing/Llama-3.1-70B-Instruct-FP8-block
Text Generation
• Updated RedHatAI/Qwen3-14B-FP8-block
Text Generation
• 15B • Updated • 18
nm-testing/Qwen3-30B-A3B-FP8-block
Text Generation
• 3B • Updated • 5
RedHatAI/Qwen3-32B-FP8-block
Text Generation
• 33B • Updated • 19