RedHatAI/NVIDIA-Nemotron-3-Ultra-550B-A55B-FP8-dynamic Text Generation • 561B • Updated about 5 hours ago • 13.6k • 1
RedHatAI/NVIDIA-Nemotron-3-Ultra-550B-A55B-quantized.w4a16 Text Generation • 565B • Updated 2 days ago • 1.62k • 3
RedHatAI/NVIDIA-Nemotron-3-Ultra-550B-A55B-FP8-block Text Generation • 561B • Updated 2 days ago • 1.41k
RedHatAI/NVIDIA-Nemotron-3-Ultra-550B-A55B-quantized.w4a16 Text Generation • 565B • Updated 2 days ago • 1.62k • 3
RedHatAI/NVIDIA-Nemotron-3-Ultra-550B-A55B-FP8-block Text Generation • 561B • Updated 2 days ago • 1.41k
RedHatAI/NVIDIA-Nemotron-3-Ultra-550B-A55B-FP8-dynamic Text Generation • 561B • Updated about 5 hours ago • 13.6k • 1
inference-optimization/Qwen3.6-35B-A3B-6.0-bits-mode-hybrid Image-Text-to-Text • 28B • Updated May 20 • 103
inference-optimization/Qwen3.6-35B-A3B-6.0-bits-mode-hybrid Image-Text-to-Text • 28B • Updated May 20 • 103
inference-optimization/Qwen3.6-35B-A3B-6.0-bits-mode-hybrid Image-Text-to-Text • 28B • Updated May 20 • 103