Inference Providers
Active filters: int4
ModelCloud/EXAONE-3.0-7.8B-Instruct-gptq-4bit
8B • Updated • 2
• 3
RedHatAI/Meta-Llama-3.1-405B-Instruct-quantized.w4a16
Text Generation
• 409B • Updated • 453
• 12
angeloc1/llama3dot1FoodDel4v05
Text Generation
• 8B • Updated • 6
zzzmahesh/Meta-Llama-3-8B-Instruct-quantized.w4a4
Text Generation
• 8B • Updated • 5
• 1
ModelCloud/GRIN-MoE-gptq-4bit
42B • Updated • 7
• 6
joshmiller656/Llama3.2-1B-AWQ-INT4
1B • Updated • 5
Advantech-EIOT/intel_llama-3.1-8b-instruct
RedHatAI/Qwen2.5-7B-quantized.w4a16
Text Generation
• 8B • Updated • 59
joshmiller656/Llama-3.1-Nemotron-70B-Instruct-AWQ-INT4
Text Generation
• 71B • Updated • 6
• 3
ModelCloud/Llama-3.2-1B-Instruct-gptqmodel-4bit-vortex-v1
Text Generation
• 1B • Updated • 165
• 2
jojo1899/llama-3_1-8b-instruct-ov-int4
ModelCloud/Llama-3.2-1B-Instruct-gptqmodel-4bit-vortex-v2
Text Generation
• 1B • Updated • 6
• 3
ModelCloud/Llama-3.2-3B-Instruct-gptqmodel-4bit-vortex-v3
Text Generation
• 4B • Updated • 26
• 5
tclf90/qwen2.5-72b-instruct-gptq-int4
Text Generation
• 73B • Updated • 61
• 2
ModelCloud/Llama-3.2-1B-Instruct-gptqmodel-4bit-vortex-v2.5
Text Generation
• 1B • Updated • 125
• 5
jojo1899/Phi-3.5-mini-instruct-ov-int4
ModelCloud/Qwen2.5-Coder-32B-Instruct-gptqmodel-4bit-vortex-v1
Text Generation
• 33B • Updated • 63
• 16
RedHatAI/Sparse-Llama-3.1-8B-evolcodealpaca-2of4-FP8-dynamic
Text Generation
• 8B • Updated • 10
RedHatAI/Sparse-Llama-3.1-8B-evolcodealpaca-2of4-quantized.w4a16
Text Generation
• 2B • Updated • 5
ModelCloud/QwQ-32B-Preview-gptqmodel-4bit-vortex-v1
Text Generation
• 33B • Updated • 12
• 51
ModelCloud/QwQ-32B-Preview-gptqmodel-4bit-vortex-v2
Text Generation
• 33B • Updated • 10
• 16
ModelCloud/QwQ-32B-Preview-gptqmodel-4bit-vortex-v3
Text Generation
• 33B • Updated • 12
• 14
ModelCloud/Falcon3-10B-Instruct-gptqmodel-4bit-vortex-v1
Text Generation
• 10B • Updated • 10
• 3
RedHatAI/Llama-3.3-70B-Instruct-quantized.w4a16
Text Generation
• 71B • Updated • 9.84k
• 3
RedHatAI/Mixtral-8x22B-v0.1-quantized.w4a16
141B • Updated • 29
RedHatAI/Mixtral-8x7B-v0.1-quantized.w4a16
47B • Updated • 129
RedHatAI/QwQ-32B-Preview-quantized.w4a16
33B • Updated • 12
RedHatAI/Llama-3.1-Nemotron-70B-Instruct-HF-quantized.w4a16
Text Generation
• 71B • Updated • 12
nintwentydo/pixtral-12b-2409-W4A16-G128
Image-Text-to-Text
• 13B • Updated • 76
• 2
RedHatAI/granite-3.1-8b-instruct-quantized.w4a16
Text Generation
• 8B • Updated • 1.51k
• 1