-
-
-
-
-
-
Inference Providers
Active filters: fp8
groxaxo/DeepSWE-Preview-FP8
33B • Updated
• 2
• 1
thorejaya/Affine-5DWmAmd1BRYNy8adMx7EV8oUhVuiAcq8tFiYuBVcF2xp3WPa
groxaxo/Qwen3-32B-AWorld-W8A16
9B • Updated
• 3
• 2
groxaxo/gpt-oss-20b-ShiningValiant3-W8A16
Text Generation
• 20B • Updated
• 13
• 2
vschandramourya/mistral-3.2-instruct-2506-quantized
Updated
poorbag/Affine-5DWmAmd1BRYNy8adMx7EV8oUhVuiAcq8tFiYuBVcF2xp3WPa
Updated
RedHatAI/Qwen3-Coder-480B-A35B-Instruct-FP8
Text Generation
• 480B • Updated
• 76
• 3
unsloth/Kimi-K2-Instruct-0905
Text Generation
• Updated
• 51
• 7
nvidia/Phi-4-multimodal-instruct-FP8
6B • Updated
• 15.1k
• 4
nvidia/Phi-4-reasoning-plus-FP8
15B • Updated
• 454
• 3
Text Generation
• Updated
• 4
baseten-admin/kimi-0905-fp4
581B • Updated
• 815
EliovpAI/Deepseek-R1-0528-Qwen3-8B-FP8-KV
Text Generation
• 8B • Updated
• 2
remodlai/Llama-3.2-1B-Instruct-Nova-FP8
remodlai/Qwen3-4B-Thinking-2507-Nova-fp8
Isotr0py/DeepSeek-V3-0324-tiny
Text Generation
• Updated
• 4
DevQuasar/moonshotai.Kimi-K2-Instruct-0905-BF16
Text Generation
• 1T • Updated
• 17
cmsptcp/Llama-PLLuM-8B-instruct-FP8-Dynamic
phazei/HunyuanVideo-Foley
Text Generation
• 8B • Updated
• 5.54k
• 3
nvidia/Qwen2.5-VL-7B-Instruct-FP8
Text Generation
• 8B • Updated
• 338
• 7
jobs-git/Kimi-K2-Instruct-GGUF
Text Generation
• 1T • Updated
• 232
TheClusterDev/Qwen3-Next-80B-A3B-Instruct-FP8
Text Generation
• 80B • Updated
• 155
• 1
datasysdev/qwen3-30b-a3b-pruned
Text Generation
• 31B • Updated
• 1
datasysdev/qwen3-30b-a3b-new-pruned-arch
22B • Updated
• 1
Text Generation
• 1T • Updated
• 7
jobs-git/Kimi-K2-Instruct-0905
Text Generation
• 1T • Updated
• 13
TheClusterDev/Qwen3-Next-80B-A3B-Instruct-FP8-Dynamic
Text Generation
• 80B • Updated
• 1.42k
• 4
chutesai/DeepSeek-V3.1-NextN
12B • Updated
• 1
dasLOL/Affine-12412414412124123
Text Generation
• 1T • Updated
• 34