Nemotron Agentic & Tool-Use Collection Datasets for building models capable of function calling, multi-step agentic tasks, terminal use, and SWE workflows. • 11 items • Updated 17 days ago • 11
view article Article From Scarcity to Scale: How Synthetic Personas Can Bootstrap Japanese AI Development nvidia • Feb 19 • 3
view article Article Task-Seeded Synthetic Q&A Generation for Nemotron Pretraining nvidia • 24 days ago • 17
Nemotron Vision-Language Collection Image-text paired datasets for building vision-language models (VLMs). • 3 items • Updated 17 days ago • 8
view article Article The First Healthcare Robotics Dataset and Foundational Physical AI Models for Healthcare Robotics nvidia • Mar 16 • 31
view article Article The Open Evaluation Standard: Benchmarking NVIDIA Nemotron 3 Nano with NeMo Evaluator nvidia • Dec 17, 2025 • 50
view article Article Introducing SPEED-Bench: A Unified and Diverse Benchmark for Speculative Decoding nvidia • Mar 19 • 47
Nemotron-Post-Training-v3 Collection Collection of datasets used in the post-training phase of Nemotron Nano, Super, and Ultra v3. • 50 items • Updated 17 days ago • 168
Nemotron-Pre-Training-Datasets Collection Large scale pre-training datasets used in the Nemotron family of models. • 15 items • Updated 17 days ago • 173
Nemotron-Personas Collection A collection of multilingual, region-specific synthetic persona datasets that support sovereign AI development across many countries and regions. • 10 items • Updated 11 days ago • 56
Nemotron RAG Collection Set of tools to build retrieval-augmented generation (RAG) systems, improve search and ranking accuracy, and extract structured data from complex docs • 11 items • Updated 17 days ago • 93
Physical AI Collection Collection of open, commercial-grade datasets for physical AI developers • 53 items • Updated 17 days ago • 166
view article Article Nemotron-Personas-India: Synthesized Data for Sovereign AI nvidia • Oct 13, 2025 • 14