Running on CPU Upgrade 164 The Synthetic Data Playbook: Generating Trillions of the Finest Tokens 📝 164 Explore synthetic data experiments in a bookshelf view
Nemotron-Post-Training-v3 Collection Collection of datasets used in the post-training phase of Nemotron Nano and Super v3. • 26 items • Updated 2 days ago • 85
Nemotron-Pre-Training-Datasets Collection Large scale pre-training datasets used in the Nemotron family of models. • 12 items • Updated 2 days ago • 115
RedSage Models Collection Continued Pretraining and Post-trained RedSage Models. • 5 items • Updated Feb 9