RedHatAI/NVIDIA-Nemotron-3-Super-120B-A12B-NVFP4 Text Generation • 67B • Updated 10 days ago • 3.08k • 2
RedHatAI/Meta-Llama-3.1-8B-Instruct-quantized.w4a16 Text Generation • 8B • Updated 17 days ago • 75.3k • 30
RedHatAI/Meta-Llama-3.1-8B-Instruct-FP8-dynamic Text Generation • 8B • Updated 17 days ago • 54.3k • 9
RedHatAI/Meta-Llama-3.1-8B-Instruct-quantized.w8a8 Text Generation • 8B • Updated 17 days ago • 28.4k • 20
RedHatAI/Mistral-Small-24B-Instruct-2501-quantized.w4a16 Text Generation • 24B • Updated 17 days ago • 437 • 1
RedHatAI/granite-3.1-8b-instruct-quantized.w4a16 Text Generation • 8B • Updated 17 days ago • 1.02k • 1
RedHatAI/Llama-3.3-70B-Instruct-quantized.w4a16 Text Generation • 71B • Updated 17 days ago • 1.47k • 3
RedHatAI/Llama-3.3-70B-Instruct-quantized.w8a8 Text Generation • 71B • Updated 17 days ago • 2.72k • 14
RedHatAI/Llama-3.1-Nemotron-70B-Instruct-HF-FP8-dynamic Text Generation • 71B • Updated 17 days ago • 234 • 15