math datasets for llm add MuggleMath meta-math/MetaMathQA Viewer • Updated Dec 21, 2023 • 395k • 8.18k • 432 Vivacem/MMIQC Viewer • Updated Jan 20, 2024 • 2.29M • 115 • 18 TIGER-Lab/WebInstructSub Viewer • Updated Oct 27, 2024 • 2.34M • 842 • 159 AI-MO/NuminaMath-CoT Viewer • Updated Nov 25, 2024 • 860k • 10.3k • 525
low quality datasets for llm allenai/tulu-v2-sft-mixture Viewer • Updated May 24, 2024 • 326k • 924 • 135 SirNeural/flan_v2 Viewer • Updated Feb 24, 2023 • 336M • 640 • 197 lmsys/lmsys-chat-1m Viewer • Updated Jul 27, 2024 • 1M • 4.56k • 784 allenai/WildChat-1M Viewer • Updated Oct 17, 2024 • 838k • 11.8k • 405
code datasets for llm ise-uiuc/Magicoder-OSS-Instruct-75K Viewer • Updated Dec 4, 2023 • 75.2k • 1.75k • 158 ise-uiuc/Magicoder-Evol-Instruct-110K Viewer • Updated Dec 28, 2023 • 111k • 3k • 170 nickrosh/Evol-Instruct-Code-80k-v1 Viewer • Updated Jul 11, 2023 • 78.3k • 1.59k • 245 m-a-p/Code-Feedback Viewer • Updated Feb 26, 2024 • 66.4k • 5.91k • 214
high quality datasets for llm dialogs of general domains with high quality. WizardLMTeam/WizardLM_evol_instruct_V2_196k Viewer • Updated Mar 10, 2024 • 143k • 1.15k • 246 MaziyarPanahi/WizardLM_evol_instruct_V2_196k Viewer • Updated Apr 23, 2024 • 286k • 79 • 53 HuggingFaceH4/ultrachat_200k Viewer • Updated Oct 16, 2024 • 515k • 26.1k • 634 Open-Orca/SlimOrca-Dedup Viewer • Updated May 19, 2025 • 363k • 15.2k • 90
math datasets for llm add MuggleMath meta-math/MetaMathQA Viewer • Updated Dec 21, 2023 • 395k • 8.18k • 432 Vivacem/MMIQC Viewer • Updated Jan 20, 2024 • 2.29M • 115 • 18 TIGER-Lab/WebInstructSub Viewer • Updated Oct 27, 2024 • 2.34M • 842 • 159 AI-MO/NuminaMath-CoT Viewer • Updated Nov 25, 2024 • 860k • 10.3k • 525
code datasets for llm ise-uiuc/Magicoder-OSS-Instruct-75K Viewer • Updated Dec 4, 2023 • 75.2k • 1.75k • 158 ise-uiuc/Magicoder-Evol-Instruct-110K Viewer • Updated Dec 28, 2023 • 111k • 3k • 170 nickrosh/Evol-Instruct-Code-80k-v1 Viewer • Updated Jul 11, 2023 • 78.3k • 1.59k • 245 m-a-p/Code-Feedback Viewer • Updated Feb 26, 2024 • 66.4k • 5.91k • 214
low quality datasets for llm allenai/tulu-v2-sft-mixture Viewer • Updated May 24, 2024 • 326k • 924 • 135 SirNeural/flan_v2 Viewer • Updated Feb 24, 2023 • 336M • 640 • 197 lmsys/lmsys-chat-1m Viewer • Updated Jul 27, 2024 • 1M • 4.56k • 784 allenai/WildChat-1M Viewer • Updated Oct 17, 2024 • 838k • 11.8k • 405
high quality datasets for llm dialogs of general domains with high quality. WizardLMTeam/WizardLM_evol_instruct_V2_196k Viewer • Updated Mar 10, 2024 • 143k • 1.15k • 246 MaziyarPanahi/WizardLM_evol_instruct_V2_196k Viewer • Updated Apr 23, 2024 • 286k • 79 • 53 HuggingFaceH4/ultrachat_200k Viewer • Updated Oct 16, 2024 • 515k • 26.1k • 634 Open-Orca/SlimOrca-Dedup Viewer • Updated May 19, 2025 • 363k • 15.2k • 90