RedHatAI/Llama-4-Scout-17B-16E-Instruct-FP8-dynamic Image-Text-to-Text • 109B • Updated Sep 22, 2025 • 39.9k • 28
RedHatAI/Llama-4-Scout-17B-16E-Instruct-quantized.w4a16 Image-Text-to-Text • 20B • Updated Sep 22, 2025 • 28.7k • 12
RedHatAI/Llama-4-Maverick-17B-128E-Instruct Image-Text-to-Text • 402B • Updated Sep 22, 2025 • 19 • 3
RedHatAI/Llama-4-Maverick-17B-128E-Instruct-FP8 Image-Text-to-Text • 402B • Updated Sep 22, 2025 • 26.8k • 2
RedHatAI/Mistral-Small-3.1-24B-Instruct-2503-FP8-dynamic Image-Text-to-Text • 24B • Updated Oct 29, 2025 • 2.42k • 9
RedHatAI/Mistral-Small-3.1-24B-Instruct-2503-quantized.w8a8 Image-Text-to-Text • 24B • Updated Oct 29, 2025 • 221 • 5
RedHatAI/Mistral-Small-3.1-24B-Instruct-2503-quantized.w4a16 Image-Text-to-Text • 5B • Updated Oct 29, 2025 • 2.09k • 10
RedHatAI/Mistral-Small-3.1-24B-Instruct-2503 Image-Text-to-Text • 24B • Updated Sep 22, 2025 • 38 • 1
RedHatAI/Mistral-Small-24B-Instruct-2501-FP8-dynamic Text Generation • 24B • Updated Oct 29, 2025 • 24.1k • 13
RedHatAI/Mistral-Small-24B-Instruct-2501-quantized.w8a8 Text Generation • 24B • Updated Oct 29, 2025 • 19.7k • 1
RedHatAI/Llama-3.3-70B-Instruct-FP8-dynamic Text Generation • 71B • Updated Dec 12, 2025 • 30.6k • 15
RedHatAI/Llama-3.1-Nemotron-70B-Instruct-HF-FP8-dynamic Text Generation • 71B • Updated Oct 23, 2025 • 1.43k • 15
RedHatAI/Llama-3.3-70B-Instruct-quantized.w8a8 Text Generation • 71B • Updated Sep 22, 2025 • 3.93k • 13
RedHatAI/Llama-3.3-70B-Instruct-quantized.w4a16 Text Generation • 11B • Updated Sep 22, 2025 • 2.12k • 3
RedHatAI/granite-3.1-8b-instruct-quantized.w8a8 Text Generation • 8B • Updated Sep 25, 2025 • 142 • 2
RedHatAI/granite-3.1-8b-instruct-quantized.w4a16 Text Generation • 1B • Updated Sep 22, 2025 • 1.08k • 1
RedHatAI/Mistral-Small-24B-Instruct-2501-quantized.w4a16 Text Generation • 4B • Updated Oct 29, 2025 • 397 • 1
RedHatAI/Meta-Llama-3.1-8B-Instruct-quantized.w8a8 Text Generation • 8B • Updated Sep 22, 2025 • 9.25k • 20
RedHatAI/Meta-Llama-3.1-8B-Instruct-FP8-dynamic Text Generation • 8B • Updated 15 days ago • 26.2k • 9
RedHatAI/Meta-Llama-3.1-8B-Instruct-quantized.w4a16 Text Generation • 8B • Updated Feb 13 • 34.3k • 30