Inference Providers
Active filters: fp8
newmindai/Llama-3.1-8B-Instruct-w16a16-4nodes-bs32
Text Generation
• 8B • Updated • 6
newmindai/Llama-3.1-8B-Instruct-w16a8-4nodes-bs32
Text Generation
• 8B • Updated • 6
newmindai/Llama-3.1-8B-Instruct-w16a16-8nodes-bs32
Text Generation
• 8B • Updated • 2
Text Generation
• 33B • Updated • 6
cerebras/MiniMax-M2-REAP-139B-A10B
Text Generation
• Updated • 23
• 18
yanolja/YanoljaNEXT-Rosetta-12B-2510-FP8
12B • Updated • 22
jane-street/dormant-model-3
671B • Updated • 163
• 5
nex-agi/DeepSeek-V3.1-Nex-N1
Text Generation
• 671B • Updated • 431
• 43
Text Generation
• 685B • Updated • 30
Kwai-Keye/Keye-VL-671B-A37B
Video-Text-to-Text
• Updated • 8
• 19
ai-sage/GigaChat3-10B-A1.8B
Text Generation
• 11B • Updated • 7.73k
• 66
ai-sage/GigaChat3-702B-A36B-preview
Text Generation
• Updated • 216
• 87
starbix/Apertus-8B-Instruct-2509-FP8_dynamic
Text Generation
• 8B • Updated • 1
unsloth/Qwen3-4B-Instruct-2507-FP8
Text Generation
• Updated • 1.72k
• 4
unsloth/Qwen3-4B-Thinking-2507-FP8
Text Generation
• Updated • 1.35k
• 2
Nulia/WeirdCompound-v1.7-24b-FP8-Dynamic
Text Generation
• 24B • Updated • 141
geonmin-kim/Qwen3-MoE-1.2B-A0.6B-FP8
1B • Updated • 51
newmindai/Llama-3.1-8B-Instruct-w16a8-4nodes-bs64
Text Generation
• 8B • Updated • 3
newmindai/Llama-3.1-8B-Instruct-w16a8-8nodes-bs64
Text Generation
• 8B • Updated • 5
TevunahAi/NextCoder-7B-FP8
Text Generation
• 8B • Updated • 4
TevunahAi/NextCoder-14B-FP8
Text Generation
• 15B • Updated • 4
Float16-cloud/typhoon2.5-qwen3-4b-fp8
Text Generation
• 4B • Updated • 12
TevunahAi/NextCoder-32B-FP8
Text Generation
• 33B • Updated • 10
Float16-cloud/typhoon-ocr-3b-fp8
Image-Text-to-Text
• 4B • Updated • 1
TevunahAi/granite-8b-code-instruct-4k-FP8
Text Generation
• 8B • Updated • 6
TevunahAi/granite-20b-code-instruct-8k-FP8
Text Generation
• 20B • Updated • 8
TevunahAi/granite-34b-code-instruct-8k-FP8
Text Generation
• 34B • Updated • 8
TevunahAi/Qwen3-Next-80B-A3B-Instruct-FP8
Text Generation
• Updated • 4
vlnkane/DeepSeek-V3-4Layer
15B • Updated unsloth/Qwen3-VL-2B-Instruct-FP8
Image-Text-to-Text
• 2B • Updated • 298