Pre-training datasets 0xDing/wikipedia-cn-20230720-filtered Viewer • Updated Jul 23, 2023 • 255k • 2.3k • 171 Skywork/SkyPile-150B Viewer • Updated Dec 7, 2023 • 1.76M • 10.6k • 407
Supervised fine-tuning datasets BelleGroup/train_2M_CN Viewer • Updated Apr 8, 2023 • 2M • 1.84k • 110 BelleGroup/train_0.5M_CN Viewer • Updated Apr 3, 2023 • 519k • 2.11k • 122 BelleGroup/generated_chat_0.4M Viewer • Updated Apr 8, 2023 • 396k • 476 • 69 BelleGroup/school_math_0.25M Viewer • Updated Apr 8, 2023 • 248k • 287 • 105
Pre-training datasets 0xDing/wikipedia-cn-20230720-filtered Viewer • Updated Jul 23, 2023 • 255k • 2.3k • 171 Skywork/SkyPile-150B Viewer • Updated Dec 7, 2023 • 1.76M • 10.6k • 407
Supervised fine-tuning datasets BelleGroup/train_2M_CN Viewer • Updated Apr 8, 2023 • 2M • 1.84k • 110 BelleGroup/train_0.5M_CN Viewer • Updated Apr 3, 2023 • 519k • 2.11k • 122 BelleGroup/generated_chat_0.4M Viewer • Updated Apr 8, 2023 • 396k • 476 • 69 BelleGroup/school_math_0.25M Viewer • Updated Apr 8, 2023 • 248k • 287 • 105