RedHatAI/Apertus-70B-Instruct-2509-quantized.w4a16 Text Generation • 11B • Updated Sep 23, 2025 • 21 • 1
RedHatAI/Qwen3-VL-235B-A22B-Instruct-FP8-dynamic Text Generation • 236B • Updated Oct 3, 2025 • 46 • 4
RedHatAI/Qwen3-VL-235B-A22B-Instruct-FP8-block Text Generation • 236B • Updated Oct 27, 2025 • 15 • 3
RedHatAI/NVIDIA-Nemotron-Nano-9B-v2-FP8-dynamic Text Generation • 9B • Updated Oct 14, 2025 • 1.44k • 2
nm-testing/Llama-4-Scout-17B-16E-Instruct-BLOCK-FP8 Text Generation • 109B • Updated Oct 27, 2025 • 5
RedHatAI/Llama-4-Scout-17B-16E-Instruct-FP8-block Text Generation • 109B • Updated Oct 27, 2025 • 35 • 3
RedHatAI/Llama-4-Maverick-17B-128E-Instruct-FP8-block Text Generation • 402B • Updated Oct 27, 2025 • 9 • 1