nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-FP8 Text Generation • 124B • Updated 11 days ago • 370k • 244
Running 3.83k The Ultra-Scale Playbook 🌌 3.83k The ultimate guide to training LLM on large GPU Clusters