Spaces:
Running
Running
Update README.md
#3
by WillJenningsDC - opened
README.md
CHANGED
|
@@ -92,7 +92,7 @@ Every model NVIDIA ships rests on a data layer — and that data shapes how the
|
|
| 92 |
| If you want to... | Use this Collection | Start with these datasets |
|
| 93 |
| :---- | :---- | :---- |
|
| 94 |
| **FOUNDATION** | | |
|
| 95 |
-
| Pre-train a base model | [**Nemotron Pre-Training Collection**](https://huggingface.co/collections/nvidia/nemotron-pre-training-datasets) | Nemotron-
|
| 96 |
| **BUILD A CAPABILITY** | | |
|
| 97 |
| Math reasoning, proofs, and quantitative problem-solving | [**Nemotron Math & Reasoning Collection**](https://huggingface.co/collections/nvidia/nemotron-math-and-reasoning) | Nemotron-SFT-Math-v3, Nemotron-Math-v2, AceReason-Math, Nemotron-CC-Math-v1 |
|
| 98 |
| Code generation, debugging, and SWE workflows | [**Nemotron Code & SWE Collection**](https://huggingface.co/collections/nvidia/nemotron-code-and-swe) | Nemotron-SFT-Competitive-Programming-v2, Nemotron-SFT-SWE-v2, Nemotron-CC-Code-v1 |
|
|
@@ -107,4 +107,4 @@ Every model NVIDIA ships rests on a data layer — and that data shapes how the
|
|
| 107 |
| Evaluate model performance | **Nemotron Eval & Benchmark Collection** | SPEED-bench |
|
| 108 |
| **SPECIALIZED & SOVEREIGN** | | |
|
| 109 |
| Multilingual or domain-specific (e.g. finance) capability | [**Nemotron Supervised Fine-Tuning Collection**](https://huggingface.co/collections/nvidia/nemotron-supervised-fine-tuning) | Nemotron-SFT-Multilingual-v1, Nemotron-SpecializedDomains-Finance-v1 |
|
| 110 |
-
| Diverse synthetic personas grounded in real population distributions | [**Nemotron Personas Collection**](https://huggingface.co/collections/nvidia/nemotron-personas) | Nemotron-Personas-USA / India / Japan / Brazil / France / Singapore |
|
|
|
|
| 92 |
| If you want to... | Use this Collection | Start with these datasets |
|
| 93 |
| :---- | :---- | :---- |
|
| 94 |
| **FOUNDATION** | | |
|
| 95 |
+
| Pre-train a base model | [**Nemotron Pre-Training Collection**](https://huggingface.co/collections/nvidia/nemotron-pre-training-datasets) | [Nemotron-Pretraining-Legal-v1](https://huggingface.co/datasets/nvidia/Nemotron-Pretraining-Legal-v1), [Nemotron-Pretraining-Specialized-v1.2](https://huggingface.co/datasets/nvidia/Nemotron-Pretraining-Specialized-v1.2),[Nemotron-Pretraining-Code-v3](https://huggingface.co/datasets/nvidia/Nemotron-Pretraining-Code-v3) |
|
| 96 |
| **BUILD A CAPABILITY** | | |
|
| 97 |
| Math reasoning, proofs, and quantitative problem-solving | [**Nemotron Math & Reasoning Collection**](https://huggingface.co/collections/nvidia/nemotron-math-and-reasoning) | Nemotron-SFT-Math-v3, Nemotron-Math-v2, AceReason-Math, Nemotron-CC-Math-v1 |
|
| 98 |
| Code generation, debugging, and SWE workflows | [**Nemotron Code & SWE Collection**](https://huggingface.co/collections/nvidia/nemotron-code-and-swe) | Nemotron-SFT-Competitive-Programming-v2, Nemotron-SFT-SWE-v2, Nemotron-CC-Code-v1 |
|
|
|
|
| 107 |
| Evaluate model performance | **Nemotron Eval & Benchmark Collection** | SPEED-bench |
|
| 108 |
| **SPECIALIZED & SOVEREIGN** | | |
|
| 109 |
| Multilingual or domain-specific (e.g. finance) capability | [**Nemotron Supervised Fine-Tuning Collection**](https://huggingface.co/collections/nvidia/nemotron-supervised-fine-tuning) | Nemotron-SFT-Multilingual-v1, Nemotron-SpecializedDomains-Finance-v1 |
|
| 110 |
+
| Diverse synthetic personas grounded in real population distributions | [**Nemotron Personas Collection**](https://huggingface.co/collections/nvidia/nemotron-personas) | Nemotron-Personas-USA / India / Japan / Brazil / France / Singapore / El Salvador / Vietnam |
|