Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -92,7 +92,7 @@ Every model NVIDIA ships rests on a data layer — and that data shapes how the
92
  | If you want to... | Use this Collection | Start with these datasets |
93
  | :---- | :---- | :---- |
94
  | **FOUNDATION** |  |  |
95
- | Pre-train a base model | [**Nemotron Pre-Training Collection**](https://huggingface.co/collections/nvidia/nemotron-pre-training-datasets) | Nemotron-CC-v2.1, Nemotron-CC-Math-v1, Nemotron-CC-Code-v1, Nemotron-ClimbMix |
96
  | **BUILD A CAPABILITY** |  |  |
97
  | Math reasoning, proofs, and quantitative problem-solving | [**Nemotron Math & Reasoning Collection**](https://huggingface.co/collections/nvidia/nemotron-math-and-reasoning) | Nemotron-SFT-Math-v3, Nemotron-Math-v2, AceReason-Math, Nemotron-CC-Math-v1 |
98
  | Code generation, debugging, and SWE workflows | [**Nemotron Code & SWE Collection**](https://huggingface.co/collections/nvidia/nemotron-code-and-swe) | Nemotron-SFT-Competitive-Programming-v2, Nemotron-SFT-SWE-v2, Nemotron-CC-Code-v1 |
@@ -107,4 +107,4 @@ Every model NVIDIA ships rests on a data layer — and that data shapes how the
107
  | Evaluate model performance | **Nemotron Eval & Benchmark Collection** | SPEED-bench |
108
  | **SPECIALIZED & SOVEREIGN** |  |  |
109
  | Multilingual or domain-specific (e.g. finance) capability | [**Nemotron Supervised Fine-Tuning Collection**](https://huggingface.co/collections/nvidia/nemotron-supervised-fine-tuning) | Nemotron-SFT-Multilingual-v1, Nemotron-SpecializedDomains-Finance-v1 |
110
- | Diverse synthetic personas grounded in real population distributions | [**Nemotron Personas Collection**](https://huggingface.co/collections/nvidia/nemotron-personas) | Nemotron-Personas-USA / India / Japan / Brazil / France / Singapore |
 
92
  | If you want to... | Use this Collection | Start with these datasets |
93
  | :---- | :---- | :---- |
94
  | **FOUNDATION** |  |  |
95
+ | Pre-train a base model | [**Nemotron Pre-Training Collection**](https://huggingface.co/collections/nvidia/nemotron-pre-training-datasets) | [Nemotron-Pretraining-Legal-v1](https://huggingface.co/datasets/nvidia/Nemotron-Pretraining-Legal-v1), [Nemotron-Pretraining-Specialized-v1.2](https://huggingface.co/datasets/nvidia/Nemotron-Pretraining-Specialized-v1.2),[Nemotron-Pretraining-Code-v3](https://huggingface.co/datasets/nvidia/Nemotron-Pretraining-Code-v3) |
96
  | **BUILD A CAPABILITY** |  |  |
97
  | Math reasoning, proofs, and quantitative problem-solving | [**Nemotron Math & Reasoning Collection**](https://huggingface.co/collections/nvidia/nemotron-math-and-reasoning) | Nemotron-SFT-Math-v3, Nemotron-Math-v2, AceReason-Math, Nemotron-CC-Math-v1 |
98
  | Code generation, debugging, and SWE workflows | [**Nemotron Code & SWE Collection**](https://huggingface.co/collections/nvidia/nemotron-code-and-swe) | Nemotron-SFT-Competitive-Programming-v2, Nemotron-SFT-SWE-v2, Nemotron-CC-Code-v1 |
 
107
  | Evaluate model performance | **Nemotron Eval & Benchmark Collection** | SPEED-bench |
108
  | **SPECIALIZED & SOVEREIGN** |  |  |
109
  | Multilingual or domain-specific (e.g. finance) capability | [**Nemotron Supervised Fine-Tuning Collection**](https://huggingface.co/collections/nvidia/nemotron-supervised-fine-tuning) | Nemotron-SFT-Multilingual-v1, Nemotron-SpecializedDomains-Finance-v1 |
110
+ | Diverse synthetic personas grounded in real population distributions | [**Nemotron Personas Collection**](https://huggingface.co/collections/nvidia/nemotron-personas) | Nemotron-Personas-USA / India / Japan / Brazil / France / Singapore / El Salvador / Vietnam |