-
-
-
-
-
-
Inference Providers
Active filters: fp8
ajinkya-tejankar/Mistral-7B-Instruct-v0.2-FP8-UltraChat-2000-KV
7B • Updated
Infermatic/Lumimaid-v0.2-70B-FP8-Dynamic
71B • Updated
predibase/Qwen2.5-32B-Instruct-FP8
33B • Updated
• 68
Infermatic/Llama-3.1-Nemotron-70B-Instruct-HF-FP8-Dynamic
Text Generation
• 71B • Updated
• 14
predibase/Mistral-7B-Instruct-v0.2-FP8-UltraChat-2000-KV
7B • Updated
RedHatAI/Llama-3.1-Nemotron-70B-Instruct-HF-FP8-dynamic
Text Generation
• 71B • Updated
• 14.9k
• 14
132B • Updated
Infermatic/Stellar-Odyssey-12b-v0.0-FP8-Dynamic
12B • Updated
Infermatic/Chronos-Platinum-72B-FP8-Dynamic
73B • Updated
• 5
Infermatic/Nautilus-70B-v0.1-FP8-Dynamic
71B • Updated
• 1
yejingfu/nmagic-Meta-Llama-3.1-8B-Instruct-FP8
Text Generation
• 8B • Updated
• 2.81k
mysticbeing/Llama-3.1-Nemotron-70B-Instruct-HF-FP8-DYNAMIC
Text Generation
• 71B • Updated
• 5
• 3
amd/Mistral-7B-v0.1-FP8-KV
7B • Updated
• 2
• 1
yejingfu/nmagic-Meta-Llama-3.1-70B-Instruct-FP8
Text Generation
• 71B • Updated
tencent-community/Hunyuan-A52B-Instruct-FP8
Text Generation
• 389B • Updated
• 3
Dev0502/Qwen2.5-14B-Instruct-abliterated-v2-FP8
15B • Updated
• 5
andecy64/Nxcode-CQ-7B-orpo-FP8
7B • Updated
SicariusSicariiStuff/DeepSeek-Coder-V2-Instruct-FP8
236B • Updated
• 4
EmbeddedLLM/Qwen2.5-72B-Instruct-OCP-FP8-Quark
73B • Updated
• 1
yejingfu/nmagic-Meta-Llama-3-70B-Instruct-FP8
71B • Updated
EmbeddedLLM/Nexusflow_Athena-V2-Chat-OCP-FP8-Quark
73B • Updated
• 1
EmbeddedLLM/Nexusflow_Athena-V2-Agent-OCP-FP8-Quark
73B • Updated
liuxl12/Qwen2.5-32B-Instruct-FP8
33B • Updated
Model-SafeTensors/Meta-Llama-3-8B-Instruct-FP8
8B • Updated
• 3
John6666/jib-mix-flux-v5itsalive-fp8-flux
Text-to-Image
• Updated
• 1
John6666/iniverse-mix-sfwnsfw-f1drealnsfwguofengv2-fp8-flux
Text-to-Image
• Updated
• 1
• 1
John6666/acorn-is-spinning-flux-aisfluxdedistilled-fp8-flux
Text-to-Image
• Updated
• 3
John6666/uncensored-females-flux-fluxdevufv7fp16-fp8-flux
Text-to-Image
• Updated
• 14
• 10
John6666/2758-flux-asian-utopian-v30fp8noclip-fp8-flux
Text-to-Image
• Updated
• 9
RedHatAI/Qwen2.5-7B-Instruct-FP8-dynamic
Text Generation
• 8B • Updated
• 2.72k
• 1