nvidia/Nemotron-Post-Training-Dataset-v1
Viewer
• Updated
• 25.7M • 10.5k • 175
The SFT datasets for KORMo-10B were collected from diverse, publicly available source
Note SFT datasets Englsih - nvidia/Nemotron-Post-Training-Dataset-v1 (~2.8B) - HuggingFaceTB/smoltalk2 (~259.5M) Korean - kormo-lm/reasoning_ko_filter_0710 (3.37B) English & Korean - kormo-lm/KORMo-SFT-datasets (~175M)