RedHatAI/Llama-3.3-70B-Instruct-quantized.w4a16 Text Generation • 11B • Updated Sep 22, 2025 • 1.52k • 3
RedHatAI/Llama-3.3-70B-Instruct-quantized.w8a8 Text Generation • 71B • Updated Sep 22, 2025 • 2.43k • 13
RedHatAI/DeepSeek-R1-Distill-Llama-8B-quantized.w8a8 Text Generation • 8B • Updated Feb 27, 2025 • 3.96k • 2
RedHatAI/DeepSeek-R1-Distill-Llama-8B-quantized.w4a16 Text Generation • 2B • Updated Feb 27, 2025 • 505
RedHatAI/DeepSeek-R1-Distill-Llama-70B-quantized.w8a8 Text Generation • 71B • Updated Feb 27, 2025 • 243 • 2
RedHatAI/DeepSeek-R1-Distill-Qwen-7B-quantized.w4a16 Text Generation • 2B • Updated Feb 27, 2025 • 3.54k • 2
RedHatAI/DeepSeek-R1-Distill-Qwen-14B-quantized.w8a8 Text Generation • 15B • Updated Feb 27, 2025 • 2.14k • 2
RedHatAI/DeepSeek-R1-Distill-Qwen-14B-quantized.w4a16 Text Generation • 3B • Updated Feb 27, 2025 • 1.54k • 1
RedHatAI/DeepSeek-R1-Distill-Qwen-32B-quantized.w4a16 Text Generation • 6B • Updated Feb 27, 2025 • 961 • 5
RedHatAI/DeepSeek-R1-Distill-Qwen-32B-quantized.w8a8 Text Generation • Updated Feb 27, 2025 • 140 • 13
RedHatAI/DeepSeek-R1-Distill-Qwen-7B-quantized.w8a8 Text Generation • 8B • Updated Feb 27, 2025 • 4.36k • 5
RedHatAI/DeepSeek-R1-Distill-Qwen-1.5B-quantized.w8a8 Text Generation • 2B • Updated Feb 27, 2025 • 5.96k • 2
RedHatAI/DeepSeek-R1-Distill-Llama-70B-quantized.w4a16 Text Generation • 11B • Updated Feb 27, 2025 • 9.48k • 5
RedHatAI/DeepSeek-R1-Distill-Qwen-1.5B-quantized.w4a16 Text Generation • 0.6B • Updated Feb 27, 2025 • 152 • 1
ISTA-DASLab/Mistral-Small-3.1-24B-Instruct-2503-GPTQ-4b-128g Image-Text-to-Text • 5B • Updated Apr 6, 2025 • 495 • 17
RedHatAI/Mistral-Small-3.1-24B-Instruct-2503-FP8-dynamic Image-Text-to-Text • 24B • Updated Oct 29, 2025 • 165k • 9
RedHatAI/Llama-4-Scout-17B-16E-Instruct-FP8-dynamic Image-Text-to-Text • 109B • Updated Sep 22, 2025 • 3.72k • 28
RedHatAI/Mistral-Small-3.1-24B-Instruct-2503-quantized.w4a16 Image-Text-to-Text • 5B • Updated Oct 29, 2025 • 21.8k • 10
RedHatAI/Mistral-Small-3.1-24B-Instruct-2503-quantized.w8a8 Image-Text-to-Text • 24B • Updated Oct 29, 2025 • 322 • 5