Pretraining Data opencsg/Fineweb-Edu-Chinese-V2.1 Viewer • Updated Jan 28 • 958M • 15.6k • 73 allenai/dolma3_pool Preview • Updated Feb 24 • 20.4k • 34 nvidia/Nemotron-CC-v2.1 Viewer • Updated Dec 22, 2025 • 3.8B • 17.5k • 120 allenai/dolma3_dolmino_pool Updated Jan 5 • 4.96k • 8
Pretraining Data opencsg/Fineweb-Edu-Chinese-V2.1 Viewer • Updated Jan 28 • 958M • 15.6k • 73 allenai/dolma3_pool Preview • Updated Feb 24 • 20.4k • 34 nvidia/Nemotron-CC-v2.1 Viewer • Updated Dec 22, 2025 • 3.8B • 17.5k • 120 allenai/dolma3_dolmino_pool Updated Jan 5 • 4.96k • 8