-
-
-
-
-
-
Inference Providers
Active filters: fp8
cerebras/Qwen3-Coder-REAP-363B-A35B-FP8
Text Generation
• Updated
• 41
• 15
cerebras/Qwen3-Coder-REAP-246B-A35B-FP8
Text Generation
• 246B • Updated
• 678
• 21
wangkanai/wan22-fp8-i2v-loras
Text-to-Video
• Updated
• 1
Image-to-Video
• Updated
• 1
unsloth/Qwen3-VL-8B-Thinking-FP8
Image-Text-to-Text
• 9B • Updated
• 18
• 2
unsloth/Qwen3-VL-4B-Thinking-FP8
Image-Text-to-Text
• 5B • Updated
• 85
unsloth/Qwen3-VL-4B-Instruct-FP8
Image-Text-to-Text
• 5B • Updated
• 197
• 1
unsloth/Qwen3-VL-8B-Instruct-FP8
Image-Text-to-Text
• 9B • Updated
• 353
• 7
philkuz/llama-3.3-70b-instruct-fp8
Text Generation
• 71B • Updated
• 274
• 1
theostos/LLM4Docq-annotator-fp8
33B • Updated
theostos/babel-translate-fp8
33B • Updated
theostos/babel-ssreflect-fp8
33B • Updated
WenxinChen66/DeepSeek-R1-0528-Channel-INT8
Text Generation
• 685B • Updated
• 4
• 2
FlagRelease/DeepSeek-V3.2-Exp-FlagOS
685B • Updated
• 55
fraseque/Llama-3.3-70B-FP8-Instruct-Neuron
Text Generation
• 71B • Updated
• 2
Qwen/Qwen3-VL-32B-Thinking-FP8
Image-Text-to-Text
• 33B • Updated
• 38k
• 25
selimaktas/Qwen3-14B-FP8-MinMax
15B • Updated
• 1
nm-testing/Llama-4-Scout-17B-16E-Instruct-BLOCK-FP8
Text Generation
• 109B • Updated
• 1
RedHatAI/Llama-3.3-70B-Instruct-FP8-block
Text Generation
• 71B • Updated
• 797
15B • Updated
fraseque/llama-3.2-1B-FP8-Neuron
Text Generation
• 1B • Updated
• 134
6chan/krea-realtime-video-fp8
Text-to-Video
• Updated
• 131
• 6
bash99/Qwen3-30B-A3B-Instruct-2507-FP8-Dynamic
Text Generation
• Updated
• 9
RedHatAI/Llama-4-Scout-17B-16E-Instruct-FP8-block
Text Generation
• 109B • Updated
• 29
• 3
nm-testing/Llama-4-Maverick-17B-128E-Instruct-block-FP8
Text Generation
• Updated
• 2
meituan-longcat/LongCat-Flash-Omni-FP8
Text Generation
• Updated
• 3
• 15
16B • Updated
• 2
efficient-deep-research/gap_0.3_beta_0.5_lora_ckpt_56_merged_FP8
80B • Updated
Text Generation
• 685B • Updated
Text Generation
• 685B • Updated
• 5