math datasets for llm add MuggleMath meta-math/MetaMathQA Viewer • Updated Dec 21, 2023 • 395k • 60.3k • 458 Vivacem/MMIQC Viewer • Updated Jan 20, 2024 • 2.29M • 283 • 18 TIGER-Lab/WebInstructSub Viewer • Updated Oct 27, 2024 • 2.34M • 1.58k • 161 AI-MO/NuminaMath-CoT Viewer • Updated Nov 25, 2024 • 860k • 51k • 572
low quality datasets for llm allenai/tulu-v2-sft-mixture Viewer • Updated May 24, 2024 • 326k • 1.63k • 137 SirNeural/flan_v2 Viewer • Updated Feb 24, 2023 • 336M • 1.44k • 199 lmsys/lmsys-chat-1m Viewer • Updated Jul 27, 2024 • 1M • 8.57k • 880 allenai/WildChat-1M Viewer • Updated Oct 17, 2024 • 838k • 14.2k • 432
code datasets for llm ise-uiuc/Magicoder-OSS-Instruct-75K Viewer • Updated Dec 4, 2023 • 75.2k • 13.9k • 163 ise-uiuc/Magicoder-Evol-Instruct-110K Viewer • Updated Dec 28, 2023 • 111k • 13.1k • 177 nickrosh/Evol-Instruct-Code-80k-v1 Viewer • Updated Jul 11, 2023 • 78.3k • 3.15k • 248 m-a-p/Code-Feedback Viewer • Updated Feb 26, 2024 • 66.4k • 7.06k • 237
high quality datasets for llm dialogs of general domains with high quality. WizardLMTeam/WizardLM_evol_instruct_V2_196k Viewer • Updated Mar 10, 2024 • 143k • 4.38k • 249 MaziyarPanahi/WizardLM_evol_instruct_V2_196k Viewer • Updated Apr 23, 2024 • 286k • 364 • 53 HuggingFaceH4/ultrachat_200k Viewer • Updated Oct 16, 2024 • 515k • 63.7k • 700 Open-Orca/SlimOrca-Dedup Viewer • Updated May 19, 2025 • 363k • 4.46k • 91
math datasets for llm add MuggleMath meta-math/MetaMathQA Viewer • Updated Dec 21, 2023 • 395k • 60.3k • 458 Vivacem/MMIQC Viewer • Updated Jan 20, 2024 • 2.29M • 283 • 18 TIGER-Lab/WebInstructSub Viewer • Updated Oct 27, 2024 • 2.34M • 1.58k • 161 AI-MO/NuminaMath-CoT Viewer • Updated Nov 25, 2024 • 860k • 51k • 572
code datasets for llm ise-uiuc/Magicoder-OSS-Instruct-75K Viewer • Updated Dec 4, 2023 • 75.2k • 13.9k • 163 ise-uiuc/Magicoder-Evol-Instruct-110K Viewer • Updated Dec 28, 2023 • 111k • 13.1k • 177 nickrosh/Evol-Instruct-Code-80k-v1 Viewer • Updated Jul 11, 2023 • 78.3k • 3.15k • 248 m-a-p/Code-Feedback Viewer • Updated Feb 26, 2024 • 66.4k • 7.06k • 237
low quality datasets for llm allenai/tulu-v2-sft-mixture Viewer • Updated May 24, 2024 • 326k • 1.63k • 137 SirNeural/flan_v2 Viewer • Updated Feb 24, 2023 • 336M • 1.44k • 199 lmsys/lmsys-chat-1m Viewer • Updated Jul 27, 2024 • 1M • 8.57k • 880 allenai/WildChat-1M Viewer • Updated Oct 17, 2024 • 838k • 14.2k • 432
high quality datasets for llm dialogs of general domains with high quality. WizardLMTeam/WizardLM_evol_instruct_V2_196k Viewer • Updated Mar 10, 2024 • 143k • 4.38k • 249 MaziyarPanahi/WizardLM_evol_instruct_V2_196k Viewer • Updated Apr 23, 2024 • 286k • 364 • 53 HuggingFaceH4/ultrachat_200k Viewer • Updated Oct 16, 2024 • 515k • 63.7k • 700 Open-Orca/SlimOrca-Dedup Viewer • Updated May 19, 2025 • 363k • 4.46k • 91