-
-
-
-
-
-
Inference Providers
Active filters: fp8
NotDivYt/Deepseek_R1_DIVV
Text Generation
• 685B • Updated
4B • Updated
Azure99/Blossom-V6.1-32B-FP8
33B • Updated
• 3
Azure99/Blossom-V6.1-14B-FP8
15B • Updated
Azure99/Blossom-V6.1-8B-FP8
8B • Updated
• 1
Azure99/Blossom-V6.1-GLM-32B-FP8
33B • Updated
• 1
baseten/Kimi-K2-Instruct-FP4
581B • Updated
• 1.27k
• 1
1T • Updated
• 4
Yi30/Kimi-K2-Instruct-G2-0716
1T • Updated
• 2
33B • Updated
• 45
• 10
Text Generation
• Updated
• 1.56k
• 78
unsloth/Qwen3-235B-A22B-Instruct-2507-FP8
Text Generation
• Updated
• 156
• 3
Qwen/Qwen3-Coder-480B-A35B-Instruct-FP8
Text Generation
• Updated
• 138k
• • 148
unsloth/Qwen3-Coder-480B-A35B-Instruct-FP8
Text Generation
• 480B • Updated
• 48
• 8
michaelbenayoun/deepseekv3-tiny-4kv-heads-4-layers-random
Text Generation
• 5.27M • Updated
• 2
MollyHexapotato/custom-deepseek-r1-4L
15B • Updated
Image-Text-to-Text
• Updated
• 215
• 40
unsloth/Qwen3-235B-A22B-Thinking-2507-FP8
Text Generation
• Updated
• 17
• 2
werty1248/Midm-2.0-Base-Instruct-FP8
12B • Updated
RedHatAI/SmolLM3-3B-FP8-dynamic
Text Generation
• 3B • Updated
• 392
• 1
unsloth/Qwen3-30B-A3B-Instruct-2507-FP8
Text Generation
• Updated
• 660
• 7
unsloth/Qwen3-30B-A3B-Thinking-2507-FP8
Updated
• 104
• 2
willcb/Qwen3-30B-A3B-Instruct-2507-FP8
Text Generation
• 31B • Updated
• 10
yonghenglh6/vlm_test_model_for_sglang_toy
8B • Updated
• 1
Text Generation
• 31B • Updated
• 2
Image-Text-to-Text
• Updated
• 779
• 20
Text Generation
• 685B • Updated
• 3