-
-
-
-
-
-
Inference Providers
Active filters: int8
FriendliAI/Meta-Llama-3.1-70B-Instruct-int8
Text Generation
• 71B • Updated
• 2
RedHatAI/Qwen2.5-7B-Instruct-quantized.w8a8
Text Generation
• 8B • Updated
• 292
• 2
RedHatAI/Qwen2.5-0.5B-quantized.w8a16
Text Generation
• 0.4B • Updated
RedHatAI/Qwen2.5-1.5B-quantized.w8a16
Text Generation
• 0.8B • Updated
• 1
RedHatAI/Qwen2.5-3B-quantized.w8a16
Text Generation
• 1B • Updated
• 1
RedHatAI/Qwen2.5-7B-quantized.w8a16
Text Generation
• 3B • Updated
• 1
• 1
RedHatAI/Qwen2.5-32B-quantized.w8a16
Text Generation
• 9B • Updated
• 2
RedHatAI/Qwen2.5-72B-quantized.w8a16
Text Generation
• 20B • Updated
• 1
avans06/Meta-Llama-3.1-8B-Instruct-ct2-int8_float16
Text Generation
• Updated
• 2
avans06/Meta-Llama-3.2-8B-Instruct-ct2-int8_float16
Text Generation
• Updated
• 20
minpeter/Qwen-Qwen2.5-14B-Instruct-fmo-int8
15B • Updated
minpeter/Qwen-Qwen2.5-32B-Instruct-fmo-int8
33B • Updated
minpeter/anthracite-org-magnum-v4-72b-fmo-int8
73B • Updated
SteveTran/T5-small-query-expansion-INT8
Text Generation
• Updated
• 7
• 2
Text Generation
• 0.1B • Updated
• 303
• 3
mradermacher/ecastera-eva-westlake-7b-spanish-GGUF
7B • Updated
• 86
RedHatAI/Llama-3.1-Nemotron-70B-Instruct-HF-quantized.w8a8
Text Generation
• 71B • Updated
• 3
RedHatAI/QwQ-32B-Preview-quantized.w8a8
Text Generation
• 33B • Updated
• 1
NeoChen1024/Dolphin3.0-Llama3.1-8B-W8A8
8B • Updated
NeoChen1024/dolphin-2.9.3-mistral-7B-32k-W8A8
7B • Updated
• 3
RedHatAI/granite-3.1-8b-instruct-quantized.w8a8
Text Generation
• 8B • Updated
• 113
• 2
RedHatAI/granite-3.1-2b-instruct-quantized.w8a8
Text Generation
• 3B • Updated
• 7
RedHatAI/granite-3.1-2b-base-quantized.w8a8
Text Generation
• 3B • Updated
• 7
RedHatAI/granite-3.1-8b-base-quantized.w8a8
Text Generation
• 8B • Updated
• 80
NeoChen1024/Ministral-8B-Instruct-2410-W8A8
8B • Updated
• 3
• 2
RedHatAI/Llama-3.3-70B-Instruct-quantized.w8a8
Text Generation
• 71B • Updated
• 1.93k
• 13
RedHatAI/DeepSeek-R1-Distill-Llama-8B-quantized.w8a8
Text Generation
• 8B • Updated
• 4.78k
• 2
RedHatAI/DeepSeek-R1-Distill-Llama-70B-quantized.w8a8
Text Generation
• 71B • Updated
• 298
• 2
RedHatAI/DeepSeek-R1-Distill-Qwen-14B-quantized.w8a8
Text Generation
• 15B • Updated
• 3.18k
• 2
RedHatAI/DeepSeek-R1-Distill-Qwen-32B-quantized.w8a8
Text Generation
• Updated
• 121
• 13