RedHatAI/Apertus-70B-Instruct-2509-quantized.w4a16 Text Generation • 11B • Updated Sep 23, 2025 • 18 • 1
RedHatAI/Qwen3-VL-235B-A22B-Instruct-FP8-dynamic Text Generation • 236B • Updated Oct 3, 2025 • 51 • 4
RedHatAI/Qwen3-VL-235B-A22B-Instruct-FP8-block Text Generation • 236B • Updated Oct 27, 2025 • 12 • 3
RedHatAI/NVIDIA-Nemotron-Nano-9B-v2-FP8-dynamic Text Generation • 9B • Updated Oct 14, 2025 • 1.49k • 2
nm-testing/Llama-4-Scout-17B-16E-Instruct-BLOCK-FP8 Text Generation • 109B • Updated Oct 27, 2025 • 4