Inference Providers
Active filters: fp8
TheClusterDev/Cydonia-v4.1-MS3.2-Magnum-Diamond-24B-FP8
24B • Updated • 2
meituan-longcat/LongCat-Flash-Chat-FP8
Text Generation
• 562B • Updated • 100
• 24
chaitnya26/DeepSeek-V3.1-Base
Text Generation
• 685B • Updated • 6
TheClusterDev/GLM-Steam-106B-A12B-v1-FP8
107B • Updated • 36
DevQuasar-2/deepseek-ai.DeepSeek-V3.1-BF16
Text Generation
• 684B • Updated • 28
Keozon/GLM-4.5-Air-fp8_e4m3-quark-gfx1100
107B • Updated • 2
• 1
groxaxo/DeepSWE-Preview-FP8
33B • Updated • 9
• 1
thorejaya/Affine-5DWmAmd1BRYNy8adMx7EV8oUhVuiAcq8tFiYuBVcF2xp3WPa
groxaxo/Qwen3-32B-AWorld-W8A16
9B • Updated • 4
• 2
moonshotai/Kimi-K2-Instruct-0905
Text Generation
• 1T • Updated • 326k
• • 698
groxaxo/gpt-oss-20b-ShiningValiant3-W8A16
Text Generation
• 20B • Updated • 8
• 2
vschandramourya/mistral-3.2-instruct-2506-quantized
Updated
poorbag/Affine-5DWmAmd1BRYNy8adMx7EV8oUhVuiAcq8tFiYuBVcF2xp3WPa
RedHatAI/Qwen3-Coder-480B-A35B-Instruct-FP8
Text Generation
• 480B • Updated • 50
• 3
unsloth/Kimi-K2-Instruct-0905
Text Generation
• Updated • 15
• 7
nvidia/Phi-4-multimodal-instruct-FP8
6B • Updated • 1.1k
• 6
nvidia/Phi-4-reasoning-plus-FP8
15B • Updated • 660
• 5
Text Generation
• Updated • 2
baseten-admin/kimi-0905-fp4
581B • Updated • 70
EliovpAI/Deepseek-R1-0528-Qwen3-8B-FP8-KV
Text Generation
• 8B • Updated • 6
remodlai/Llama-3.2-1B-Instruct-Nova-FP8
remodlai/Qwen3-4B-Thinking-2507-Nova-fp8
Isotr0py/DeepSeek-V3-0324-tiny
Text Generation
• Updated • 6
DevQuasar/moonshotai.Kimi-K2-Instruct-0905-BF16
Text Generation
• 1T • Updated • 8
cmsptcp/Llama-PLLuM-8B-instruct-FP8-Dynamic
8B • Updated • 3
phazei/HunyuanVideo-Foley
Text Generation
• 8B • Updated • 28.6k
• 4
Text Generation
• 15B • Updated • 5.98k
• 4
nvidia/Qwen2.5-VL-7B-Instruct-FP8
Text Generation
• 8B • Updated • 559
• 7
jobs-git/Kimi-K2-Instruct-GGUF
Text Generation
• 1T • Updated • 107