Nemotron-Post-Training-v3 Collection Collection of datasets used in the post-training phase of Nemotron Nano, Super, and Ultra v3. • 49 items • Updated about 7 hours ago • 153
nvidia/Nemotron-RL-Instruction-Following-Citation-Formatting-v1 Viewer • Updated 4 days ago • 9.54k • 42 • 2
Nemotron-Post-Training-v3 Collection Collection of datasets used in the post-training phase of Nemotron Nano, Super, and Ultra v3. • 49 items • Updated about 7 hours ago • 153
nvidia/Nemotron-RL-Instruction-Following-Structured-Outputs-v2 Viewer • Updated 4 days ago • 62.7k • 210 • 3
nvidia/Nemotron-RL-Instruction-Following-Structured-Outputs-v2 Viewer • Updated 4 days ago • 62.7k • 210 • 3
nvidia/Nemotron-RL-Instruction-Following-Free-Form-Formatting-v1 Viewer • Updated 4 days ago • 9.04k • 41 • 2
PivotRL: High Accuracy Agentic Post-Training at Low Compute Cost Paper • 2603.21383 • Published Mar 22 • 18
PivotRL: High Accuracy Agentic Post-Training at Low Compute Cost Paper • 2603.21383 • Published Mar 22 • 18