Inference Providers
Active filters: fp8
Text Generation
• 688B • Updated • 1
unsloth/Devstral-2-123B-Instruct-2512
125B • Updated • 259
• 3
mlx-community/Devstral-2-123B-Instruct-2512-bf16
Text Generation
• 125B • Updated • 165
RedHatAI/Qwen3-VL-32B-Instruct-FP8-dynamic
Text Generation
• 33B • Updated • 166
• 1
RedHatAI/Qwen3-VL-32B-Instruct-FP8-block
Text Generation
• 33B • Updated • 11
RedHatAI/Qwen3-VL-32B-Instruct-NVFP4
Text Generation
• 20B • Updated • 887
• 6
Text Generation
• 9B • Updated • 12
31B • Updated alexgusevski/Ministral-3-3B-Instruct-2512-mlx
Text Generation
• 3B • Updated • 9
alexgusevski/Ministral-3-8B-Instruct-2512-mlx
Text Generation
• 8B • Updated • 10
RedHatAI/Qwen3-Next-80B-A3B-Instruct-NVFP4
Text Generation
• Updated • 219
• 5
RedHatAI/Qwen3-Next-80B-A3B-Instruct-FP8-dynamic
Text Generation
• 80B • Updated • 21
• 1
RedHatAI/Qwen3-Next-80B-A3B-Instruct-FP8-block
Text Generation
• 80B • Updated • 8
aaroncaozj/BAGEL-7B-MoT_FP8
Any-to-Any
• 15B • Updated • 2
Text Generation
• 27B • Updated • 1
tsqn/Z-Image-Turbo_fp8_comfyui
Text-to-Image
• Updated • 376
• 4
jgerster0/Apertus-8B-Instruct-2509-LOGEQ-FP8_dynamic
8B • Updated RedHatAI/GLM-4.6-FP8-dynamic
Text Generation
• 353B • Updated • 37
mlx-community/mistralai_Devstral-Small-2-24B-Instruct-2512-MLX-BF16
Text Generation
• 24B • Updated • 135
codemichaeld/wan_1.3b_v4-fp8
Text Generation
• 33B • Updated • 32
• 1
snuh/hari-q2.5-thinking-fp8
73B • Updated • 11
• 1
XiaomiMiMo/MiMo-V2-Flash-Base
Text Generation
• 310B • Updated • 322
• 48
73B • Updated • 2
• 3
lovedheart/Qwen3-Next-80B-A3B-Instruct-fastllm-fp8-int4g128
Text Generation
• 81B • Updated • 6
ig1/BioMistral-7B-FP8-Dynamic
Text Generation
• 7B • Updated • 5
ig1/medgemma-27b-it-FP8-Dynamic
Text Generation
• 29B • Updated • 701
ig1/medgemma-27b-text-it-FP8-Dynamic
Text Generation
• 28B • Updated • 2.51k
8B • Updated • 300
688B • Updated • 1
• 1