common-dataset
updated
HuggingFaceH4/ultrachat_200k
Viewer
• Updated • 515k • 65.4k
• 703
Text Generation
• 7B • Updated • 5.26k
• 322
shareAI/ShareGPT-Chinese-English-90k
Preview
• Updated • 1.61k
• 280
Viewer
• Updated • 207M • 30.6k
• 504
lmsys/chatbot_arena_conversations
Viewer
• Updated • 33k • 85.8k
• 459
Viewer
• Updated • 968M • 22.3k
• 911
WizardLMTeam/WizardLM_evol_instruct_70k
Viewer
• Updated • 70k • 2.78k
• 196
LargeWorldModel/LWM-Text-Chat-1M
Text Generation
• Updated • 1.81k
• 173
Updated • 2.29k
• 123
microsoft/orca-math-word-problems-200k
Viewer
• Updated • 200k • 10.2k
• 479
Preview
• Updated • 232
• 27
Viewer
• Updated • 52.5B • 637k
• 2.79k
Yukang/LongAlpaca-16k-length
Viewer
• Updated • 6.28k • 112
• 25
Viewer
• Updated • 51.8k • 33.4k
• 821
Viewer
• Updated • 343M • 802
• 11
NousResearch/json-mode-eval
Viewer
• Updated • 100 • 665
• 44
NousResearch/func-calling-eval-singleturn
Viewer
• Updated • 112 • 38
• 8
NousResearch/func-calling-eval-glaive
Viewer
• Updated • 100 • 46
• 9
legacy-datasets/wikipedia
Updated • 119k
• 628
Viewer
• Updated • 10.4B • 796k
• 570
open-web-math/open-web-math
Viewer
• Updated • 6.32M • 40.3k
• 339
codeparrot/github-code-clean
Viewer
• Updated • 11M • 15.7k
• 137
HuggingFaceFW/fineweb-edu-score-2
Viewer
• Updated • 13.9B • 64.2k
• 86
HuggingFaceFW/fineweb-edu
Viewer
• Updated • 3.5B • 561k
• 1.07k
Viewer
• Updated • 52k • 90.6k
• 958
Viewer
• Updated • 772k • 150
• 27
YeungNLP/WizardLM_evol_instruct_V2_143k
Viewer
• Updated • 143k • 24
• 11
Viewer
• Updated • 2.94M • 52.7k
• 1.53k
WizardLMTeam/WizardLM_evol_instruct_V2_196k
Viewer
• Updated • 143k • 4.31k
• 249
timdettmers/openassistant-guanaco
Viewer
• Updated • 10.4k • 7.75k
• 441
garage-bAInd/Open-Platypus
Viewer
• Updated • 24.9k • 8.5k
• 416
Viewer
• Updated • 3.71M • 1.32M
• 682
Updated • 271
• 225
Salesforce/xlam-function-calling-60k
Viewer
• Updated • 60k • 15.1k
• 612
HuggingFaceTB/smollm-corpus
Viewer
• Updated • 237M • 57.4k
• 453
glaiveai/glaive-function-calling-v2
Viewer
• Updated • 113k • 38.5k
• 501
mlfoundations/dclm-baseline-1.0-parquet
Viewer
• Updated • 2.73B • 237k
• 38
mlfoundations/dclm-baseline-1.0
Preview
• Updated • 228k
• 266
ruslanmv/ai-medical-chatbot
Viewer
• Updated • 257k • 1.68k
• 246
Viewer
• Updated • 100k • 10.5k
• 267
Viewer
• Updated • 69.9k • 65.8k
• 399
xzuyn/manythings-translations-alpaca
Viewer
• Updated • 6.33M • 102
• 8
Viewer
• Updated • 21.9M • 3.96k
• 716
Viewer
• Updated • 1.75M • 850
• 105
mlabonne/open-perfectblend
Viewer
• Updated • 1.42M • 1.6k
• 72
mlabonne/orca-agentinstruct-1M-v1-cleaned
Viewer
• Updated • 1.05M • 195
• 67
allenai/tulu-3-sft-mixture
Viewer
• Updated • 939k • 16.5k
• 240
NovaSky-AI/Sky-T1_data_17k
Viewer
• Updated • 16.4k • 582
• 186
Viewer
• Updated • 552M • 1.21k
• 3
Viewer
• Updated • 78.1M • 381
• 6
Viewer
• Updated • 1.13M • 718
• 12
Viewer
• Updated • 16.2M • 291
• 1
Viewer
• Updated • 172k • 19
• 2
Viewer
• Updated • 62.3k • 47
• 2
Viewer
• Updated • 72.1k • 27
• 1
lianghsun/tw-instruct-500k
Viewer
• Updated • 500k • 365
• 26