RedHatAI/Llama-3.3-70B-Instruct-quantized.w4a16 Text Generation • 71B • Updated Sep 22, 2025 • 7.23k • 3
RedHatAI/Llama-3.3-70B-Instruct-quantized.w8a8 Text Generation • 71B • Updated Sep 22, 2025 • 16.4k • 14
RedHatAI/DeepSeek-R1-Distill-Llama-8B-quantized.w8a8 Text Generation • 8B • Updated Feb 27, 2025 • 161 • 2
RedHatAI/DeepSeek-R1-Distill-Llama-8B-quantized.w4a16 Text Generation • 8B • Updated Feb 27, 2025 • 6.19k
RedHatAI/DeepSeek-R1-Distill-Llama-70B-quantized.w8a8 Text Generation • 71B • Updated Feb 27, 2025 • 293 • 2
RedHatAI/DeepSeek-R1-Distill-Qwen-7B-quantized.w4a16 Text Generation • 8B • Updated Feb 27, 2025 • 39 • 2
RedHatAI/DeepSeek-R1-Distill-Qwen-14B-quantized.w8a8 Text Generation • 15B • Updated Feb 27, 2025 • 108 • 2
RedHatAI/DeepSeek-R1-Distill-Qwen-14B-quantized.w4a16 Text Generation • 15B • Updated Feb 27, 2025 • 252 • 1
RedHatAI/DeepSeek-R1-Distill-Qwen-32B-quantized.w4a16 Text Generation • 33B • Updated Feb 27, 2025 • 471 • 5
RedHatAI/DeepSeek-R1-Distill-Qwen-32B-quantized.w8a8 Text Generation • 33B • Updated Feb 27, 2025 • 248 • 13
RedHatAI/DeepSeek-R1-Distill-Qwen-7B-quantized.w8a8 Text Generation • 8B • Updated Feb 27, 2025 • 175 • 5
RedHatAI/DeepSeek-R1-Distill-Qwen-1.5B-quantized.w8a8 Text Generation • 2B • Updated Feb 27, 2025 • 114 • 2
RedHatAI/DeepSeek-R1-Distill-Llama-70B-quantized.w4a16 Text Generation • 71B • Updated Feb 27, 2025 • 2.62k • 6
RedHatAI/DeepSeek-R1-Distill-Qwen-1.5B-quantized.w4a16 Text Generation • 2B • Updated Feb 27, 2025 • 63 • 1
ISTA-DASLab/Mistral-Small-3.1-24B-Instruct-2503-GPTQ-4b-128g Image-Text-to-Text • 24B • Updated Apr 6, 2025 • 6.21k • 17
RedHatAI/Mistral-Small-3.1-24B-Instruct-2503-FP8-dynamic Image-Text-to-Text • 24B • Updated Oct 29, 2025 • 3.64k • 9
RedHatAI/Llama-4-Scout-17B-16E-Instruct-FP8-dynamic Image-Text-to-Text • 109B • Updated Sep 22, 2025 • 13.9k • 29