normal-datasets Zyphra/Zyda-2 Preview • Updated Aug 6, 2025 • 128k • 85 bigcode/the-stack-v2 Viewer • Updated Apr 23, 2024 • 5.45B • 5.41k • 447 iamtarun/python_code_instructions_18k_alpaca Viewer • Updated Jul 27, 2023 • 18.6k • 6.46k • 321 allenai/olmo-mix-1124 Viewer • Updated Aug 19, 2025 • 621M • 21.1k • 86
Instruct data nickrosh/Evol-Instruct-Code-80k-v1 Viewer • Updated Jul 11, 2023 • 78.3k • 1.6k • 246 HuggingFaceTB/smollm-corpus Viewer • Updated Sep 6, 2024 • 237M • 22.4k • 417 HuggingFaceTB/cosmopedia Viewer • Updated Aug 12, 2024 • 31.1M • 39.2k • 659
normal-datasets Zyphra/Zyda-2 Preview • Updated Aug 6, 2025 • 128k • 85 bigcode/the-stack-v2 Viewer • Updated Apr 23, 2024 • 5.45B • 5.41k • 447 iamtarun/python_code_instructions_18k_alpaca Viewer • Updated Jul 27, 2023 • 18.6k • 6.46k • 321 allenai/olmo-mix-1124 Viewer • Updated Aug 19, 2025 • 621M • 21.1k • 86
Instruct data nickrosh/Evol-Instruct-Code-80k-v1 Viewer • Updated Jul 11, 2023 • 78.3k • 1.6k • 246 HuggingFaceTB/smollm-corpus Viewer • Updated Sep 6, 2024 • 237M • 22.4k • 417 HuggingFaceTB/cosmopedia Viewer • Updated Aug 12, 2024 • 31.1M • 39.2k • 659