Inference Providers
Active filters: fp8
15B • Updated • 1
fraseque/llama-3.2-1B-FP8-Neuron
Text Generation
• 1B • Updated • 2
6chan/krea-realtime-video-fp8
Text-to-Video
• Updated • 24
• 6
bash99/Qwen3-30B-A3B-Instruct-2507-FP8-Dynamic
Text Generation
• Updated • 5
RedHatAI/Llama-4-Scout-17B-16E-Instruct-FP8-block
Text Generation
• 109B • Updated • 113
• 3
nm-testing/Llama-4-Maverick-17B-128E-Instruct-block-FP8
Text Generation
• Updated • 22
meituan-longcat/LongCat-Flash-Omni-FP8
Text Generation
• Updated • 5
• 15
16B • Updated • 9
efficient-deep-research/gap_0.3_beta_0.5_lora_ckpt_56_merged_FP8
80B • Updated Text Generation
• 685B • Updated Text Generation
• 685B • Updated • 18
Text Generation
• 685B • Updated • 21
aiqwen/DeepSeek-V3.1-Terminus
Text Generation
• 685B • Updated • 21
Text Generation
• 685B • Updated • 21
aiqwen/DeepSeek-V3.2-Exp-Base
Text Generation
• 685B • Updated • 22
aiqwen/DeepSeek-Prover-V2-671B
Text Generation
• 685B • Updated • 37
RedHatAI/granite-4.0-h-small-FP8-dynamic
Text Generation
• 32B • Updated • 228
• 2
RedHatAI/Llama-4-Maverick-17B-128E-Instruct-FP8-block
Text Generation
• 402B • Updated • 14
• 1
wangkanai/sdxl-fp8-loras-nsfw
Text-to-Image
• Updated • 1
wangkanai/wan21-fp8-loras
Text-to-Video
• Updated • 1
tencent/DeepSeek-V3.1-Terminus-W4AFP8
Text Generation
• 349B • Updated • 297
• 16
Text Generation
• 229B • Updated • 11
• 17
Text Generation
• 229B • Updated • 10
nm-testing/granite-4.0-h-small-FP8-block
Text Generation
• 32B • Updated • 8
newmindai/Llama-3.1-8B-Instruct-w16a16-tw
Text Generation
• 8B • Updated • 6
flymyd/fine-lightning-14b-FP8-Dynamic
Text Generation
• 15B • Updated • 1
newmindai/Llama-3.1-8B-Instruct-w16a8-1node-bs8
Text Generation
• 8B • Updated • 7
mistralai/Ministral-3-8B-Instruct-2512
9B • Updated • 223k
• 162
DragonLLM/Llama-Open-Finance-8B-FP8
Question Answering
• 8B • Updated • 27
• 2