Inference Providers
Active filters: fp8
vlnkane/DeepSeek-V3-4Layer
15B • Updated unsloth/Qwen3-VL-2B-Instruct-FP8
Image-Text-to-Text
• 2B • Updated • 298
KJML/typhoon2.5-qwen3-30b-a3b-FP8-Dynamic
Text Generation
• 31B • Updated • 84
unsloth/Qwen3-VL-2B-Thinking-FP8
Image-Text-to-Text
• 2B • Updated • 7
• 1
unsloth/Qwen3-VL-32B-Instruct-FP8
Image-Text-to-Text
• 33B • Updated • 976
• 1
unsloth/Qwen3-VL-32B-Thinking-FP8
Image-Text-to-Text
• 33B • Updated • 8
• 2
unsloth/Qwen3-VL-30B-A3B-Thinking-FP8
Image-Text-to-Text
• 31B • Updated • 13
unsloth/Qwen3-VL-30B-A3B-Instruct-FP8
Image-Text-to-Text
• 31B • Updated • 68
unsloth/Qwen3-VL-235B-A22B-Thinking-FP8
Image-Text-to-Text
• 236B • Updated • 15
unsloth/Qwen3-VL-235B-A22B-Instruct-FP8
Image-Text-to-Text
• 236B • Updated • 7
haydn-jones/Intern-S1-Qwen3-FP8
241B • Updated • 2
TevunahAi/Apertus-8B-Instruct-2509-FP8
Text Generation
• 8B • Updated • 14
FlagRelease/MiniMax-M2-FlagOS
229B • Updated • 11
• 1
nm-testing/Llama-3.1-8B-Instruct-QKV-Cache-FP8-Per-Tensor
Updated
nm-testing/Llama-3.1-8B-Instruct-QKV-Cache-FP8-Per-Head
Updated
nm-testing/Llama-3.1-8B-Instruct-FP8-dynamic-QKV-Cache-FP8-Per-Tensor
Updated
nm-testing/Llama-3.1-8B-Instruct-FP8-dynamic-QKV-Cache-FP8-Per-Head
Updated
nm-testing/Qwen3-32B-QKV-Cache-FP8-Per-Tensor
Updated
nm-testing/Qwen3-32B-QKV-Cache-FP8-Per-Head
Updated
nm-testing/Qwen3-32B-FP8-dynamic-QKV-Cache-FP8-Per-Tensor
Updated
nm-testing/Qwen3-32B-FP8-dynamic-QKV-Cache-FP8-Per-Head
Updated
nm-testing/Llama-3.3-70B-Instruct-QKV-Cache-FP8-Per-Tensor
Updated
nm-testing/Llama-3.3-70B-Instruct-QKV-Cache-FP8-Per-Head
Updated
nm-testing/Llama-3.3-70B-Instruct-FP8-dynamic-QKV-Cache-FP8-Per-Tensor
Updated
nm-testing/Llama-3.3-70B-Instruct-FP8-dynamic-QKV-Cache-FP8-Per-Head
Updated
TevunahAi/Apertus-70B-Instruct-2509-2048-Calibration-FP8
Text Generation
• 71B • Updated • 3
TevunahAi/gpt-oss-20b-2048-Calibration-FP8
Text Generation
• 21B • Updated • 23
• 1
TevunahAi/gpt-oss-120b-1024-Calibration-FP8
Text Generation
• 117B • Updated • 151
• 2
Text-to-Image
• Updated • 9.39k
• 45