nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-BF16 Text Generation • 124B • Updated about 2 hours ago • 6.44k • 163
Nemotron-Pre-Training-Datasets Collection Large scale pre-training datasets used in the Nemotron family of models. • 12 items • Updated 2 days ago • 115
Running on CPU Upgrade 167 The Synthetic Data Playbook: Generating Trillions of the Finest Tokens 📝 167 Explore synthetic data experiments in a bookshelf view