-
-
-
-
-
-
Inference Providers
Active filters: int8
RedHatAI/DeepSeek-R1-Distill-Qwen-32B-quantized.w8a8
Text Generation
• Updated
• 121
• 13
RedHatAI/DeepSeek-R1-Distill-Qwen-7B-quantized.w8a8
Text Generation
• 8B • Updated
• 6.51k
• 5
RedHatAI/DeepSeek-R1-Distill-Qwen-1.5B-quantized.w8a8
Text Generation
• 2B • Updated
• 7.88k
• 2
RedHatAI/Pixtral-Large-Instruct-2411-hf-quantized.w8a8
Image-Text-to-Text
• 124B • Updated
• 1
ospatch/QwQ-32B-INT8-W8A8
Text Generation
• 33B • Updated
• 4
• 5
labaispeak/stable-diffusion-2-1-openvino-int8
Text-to-Image
• Updated
• 1
ConfidentialMind/gte-multilingual-reranker-base-onnx-op14-opt-gpu-int8
Sentence Similarity
• Updated
• 293
• 1
QuantTrio/Qwen3-235B-A22B-GPTQ-Int8
Text Generation
• 235B • Updated
• 152
Gapeleon/bytedance_BAGEL-7B-MoT-INT8
Any-to-Any
• Updated
• 1
• 24
sfrontull/transloco-ita-lld
Translation
• Updated
mr-abhisharma/AceNemotron-14B-Quantize-8bit
Text Generation
• 15B • Updated
DESUCLUB/Llama-3.1-8B-Instruct-quantized.w8a8
Text Generation
• Updated
• 9
DESUCLUB/Llama-3.1-8B-Instruct-bf16-quantized.w8a8
Text Generation
• Updated
CarlOwOs/Qwen3-0.6B-Base-int8
Text Generation
• 0.8B • Updated
• 1
DESUCLUB/Qwen3-14B-v0.2-deepresearch-no-think-100-step-bf16-quantized.w8a8
Text Generation
• Updated
• 2
Text Generation
• Updated
• 1
• 1
vlad-m-dev/mobilenetv2_doc_photo_quant
Image Classification
• Updated
• 1
vlad-m-dev/mobilenet_v3_small_onnx_photo_doc
Image Classification
• Updated
• 2
janni-t/qwen3-embedding-0.6b-int8-tei-onnx
Sentence Similarity
• Updated
• 42
• 2
raul-delarosa99/bert-base-multilingual-cased-ner-es-onnx-static-int8
Token Classification
• Updated
• 161
vlad-m-dev/distiluse-base-multilingual-v2-merged-onnx
Feature Extraction
• Updated
• 1
onnx-community/distiluse-base-multilingual-v2-merged-onnx
Feature Extraction
• Updated
• 1
Parveshiiii/mistral-small-int8
Text Generation
• 7B • Updated
• 2
• 1
Chris7v7/nllb-200-3.3B-int8
Translation
• Updated
• 3
Kernicterus/whisper-large-v3-turbo-ct2-int8
AINovice2005/Voxtral-Mini-3B-2507-smashed
Audio-Text-to-Text
• Updated
AINovice2005/medgemma-4b-it-smashed
Image-Text-to-Text
• 4B • Updated
• 1
groxaxo/Qwen3-8B-abliterated-GPTQ-W8A16
3B • Updated
• 1
• 1
groxaxo/OpenCodeReasoning-Nemotron-1.1-32B-GPTQ-W8A16
Text Generation
• Updated
• 1
RedHatAI/gemma-3n-E4B-it-quantized.w8a8
Image-Text-to-Text
• 8B • Updated
• 1