math datasets for llm add MuggleMath meta-math/MetaMathQA Viewer • Updated Dec 21, 2023 • 395k • 46.4k • 463 Vivacem/MMIQC Viewer • Updated Jan 20, 2024 • 2.29M • 113 • 18 TIGER-Lab/WebInstructSub Viewer • Updated Oct 27, 2024 • 2.34M • 1.42k • 162 AI-MO/NuminaMath-CoT Viewer • Updated Nov 25, 2024 • 860k • 38.2k • 592
low quality datasets for llm allenai/tulu-v2-sft-mixture Viewer • Updated May 24, 2024 • 326k • 989 • 138 SirNeural/flan_v2 Viewer • Updated Feb 24, 2023 • 336M • 753 • 200 lmsys/lmsys-chat-1m Viewer • Updated Jul 27, 2024 • 1M • 7.03k • 925 allenai/WildChat-1M Viewer • Updated Oct 17, 2024 • 838k • 30.3k • 441
code datasets for llm ise-uiuc/Magicoder-OSS-Instruct-75K Viewer • Updated Dec 4, 2023 • 75.2k • 37.2k • 167 ise-uiuc/Magicoder-Evol-Instruct-110K Viewer • Updated Dec 28, 2023 • 111k • 22.8k • 180 nickrosh/Evol-Instruct-Code-80k-v1 Viewer • Updated Jul 11, 2023 • 78.3k • 5.28k • 249 m-a-p/Code-Feedback Viewer • Updated Feb 26, 2024 • 66.4k • 5.6k • 241
high quality datasets for llm dialogs of general domains with high quality. WizardLMTeam/WizardLM_evol_instruct_V2_196k Viewer • Updated Mar 10, 2024 • 143k • 2.18k • 250 MaziyarPanahi/WizardLM_evol_instruct_V2_196k Viewer • Updated Apr 23, 2024 • 286k • 50 • 53 HuggingFaceH4/ultrachat_200k Viewer • Updated Oct 16, 2024 • 515k • 57.3k • 737 Open-Orca/SlimOrca-Dedup Viewer • Updated May 19, 2025 • 363k • 1.63k • 93
math datasets for llm add MuggleMath meta-math/MetaMathQA Viewer • Updated Dec 21, 2023 • 395k • 46.4k • 463 Vivacem/MMIQC Viewer • Updated Jan 20, 2024 • 2.29M • 113 • 18 TIGER-Lab/WebInstructSub Viewer • Updated Oct 27, 2024 • 2.34M • 1.42k • 162 AI-MO/NuminaMath-CoT Viewer • Updated Nov 25, 2024 • 860k • 38.2k • 592
code datasets for llm ise-uiuc/Magicoder-OSS-Instruct-75K Viewer • Updated Dec 4, 2023 • 75.2k • 37.2k • 167 ise-uiuc/Magicoder-Evol-Instruct-110K Viewer • Updated Dec 28, 2023 • 111k • 22.8k • 180 nickrosh/Evol-Instruct-Code-80k-v1 Viewer • Updated Jul 11, 2023 • 78.3k • 5.28k • 249 m-a-p/Code-Feedback Viewer • Updated Feb 26, 2024 • 66.4k • 5.6k • 241
low quality datasets for llm allenai/tulu-v2-sft-mixture Viewer • Updated May 24, 2024 • 326k • 989 • 138 SirNeural/flan_v2 Viewer • Updated Feb 24, 2023 • 336M • 753 • 200 lmsys/lmsys-chat-1m Viewer • Updated Jul 27, 2024 • 1M • 7.03k • 925 allenai/WildChat-1M Viewer • Updated Oct 17, 2024 • 838k • 30.3k • 441
high quality datasets for llm dialogs of general domains with high quality. WizardLMTeam/WizardLM_evol_instruct_V2_196k Viewer • Updated Mar 10, 2024 • 143k • 2.18k • 250 MaziyarPanahi/WizardLM_evol_instruct_V2_196k Viewer • Updated Apr 23, 2024 • 286k • 50 • 53 HuggingFaceH4/ultrachat_200k Viewer • Updated Oct 16, 2024 • 515k • 57.3k • 737 Open-Orca/SlimOrca-Dedup Viewer • Updated May 19, 2025 • 363k • 1.63k • 93