common-dataset
updated
HuggingFaceH4/ultrachat_200k
Viewer
• Updated
• 515k • 35.2k
• 663
Text Generation
• 7B • Updated
• 5.08k
• 315
shareAI/ShareGPT-Chinese-English-90k
Preview
• Updated
• 612
• 278
Viewer
• Updated
• 207M • 19.8k
• 487
lmsys/chatbot_arena_conversations
Viewer
• Updated
• 33k • 1.93k
• 445
Viewer
• Updated
• 968M • 14.5k
• 892
WizardLMTeam/WizardLM_evol_instruct_70k
Viewer
• Updated
• 70k • 1.46k
• 197
LargeWorldModel/LWM-Text-Chat-1M
Text Generation
• Updated
• 284
• 174
Updated
• 973
• 122
microsoft/orca-math-word-problems-200k
Viewer
• Updated
• 200k • 5.54k
• 476
Preview
• Updated
• 221
• 27
Viewer
• Updated
• 52.5B • 157k
• 2.69k
Yukang/LongAlpaca-16k-length
Viewer
• Updated
• 6.28k • 30
• 25
Viewer
• Updated
• 51.8k • 21.7k
• 795
Viewer
• Updated
• 343M • 533
• 10
NousResearch/json-mode-eval
Viewer
• Updated
• 100 • 819
• 41
NousResearch/func-calling-eval-singleturn
Viewer
• Updated
• 112 • 10
• 7
NousResearch/func-calling-eval-glaive
Viewer
• Updated
• 100 • 20
• 8
legacy-datasets/wikipedia
Updated
• 60.3k
• 611
Viewer
• Updated
• 10.4B • 532k
• 530
open-web-math/open-web-math
Viewer
• Updated
• 6.32M • 11.7k
• 330
codeparrot/github-code-clean
Viewer
• Updated
• 11M • 17.1k
• 135
HuggingFaceFW/fineweb-edu-score-2
Viewer
• Updated
• 13.9B • 9.56k
• 85
HuggingFaceFW/fineweb-edu
Viewer
• Updated
• 3.5B • 222k
• 984
Viewer
• Updated
• 52k • 63.3k
• 926
Viewer
• Updated
• 772k • 57
• 26
YeungNLP/WizardLM_evol_instruct_V2_143k
Viewer
• Updated
• 143k • 17
• 11
Viewer
• Updated
• 2.94M • 16.8k
• 1.5k
WizardLMTeam/WizardLM_evol_instruct_V2_196k
Viewer
• Updated
• 143k • 2.57k
• 246
timdettmers/openassistant-guanaco
Viewer
• Updated
• 10.4k • 6.52k
• 440
garage-bAInd/Open-Platypus
Viewer
• Updated
• 24.9k • 7.69k
• 415
Viewer
• Updated
• 3.71M • 916k
• 640
Updated
• 226
• 224
Salesforce/xlam-function-calling-60k
Viewer
• Updated
• 60k • 6.35k
• 578
HuggingFaceTB/smollm-corpus
Viewer
• Updated
• 237M • 33.9k
• 442
glaiveai/glaive-function-calling-v2
Viewer
• Updated
• 113k • 8.02k
• 492
mlfoundations/dclm-baseline-1.0-parquet
Viewer
• Updated
• 2.73B • 8.08k
• 35
mlfoundations/dclm-baseline-1.0
Preview
• Updated
• 129k
• 256
ruslanmv/ai-medical-chatbot
Viewer
• Updated
• 257k • 1.29k
• 245
Viewer
• Updated
• 100k • 7.49k
• 265
Viewer
• Updated
• 69.9k • 120k
• 386
xzuyn/manythings-translations-alpaca
Viewer
• Updated
• 6.33M • 28
• 8
Viewer
• Updated
• 21.9M • 1.54k
• 699
Viewer
• Updated
• 1.75M • 124
• 104
mlabonne/open-perfectblend
Viewer
• Updated
• 1.42M • 869
• 67
mlabonne/orca-agentinstruct-1M-v1-cleaned
Viewer
• Updated
• 1.05M • 89
• 66
allenai/tulu-3-sft-mixture
Viewer
• Updated
• 939k • 16.7k
• 229
NovaSky-AI/Sky-T1_data_17k
Viewer
• Updated
• 16.4k • 267
• 187
Viewer
• Updated
• 552M • 111
• 2
Viewer
• Updated
• 78.1M • 373
• 5
Viewer
• Updated
• 1.13M • 755
• 10
Viewer
• Updated
• 16.2M • 348
• 1
Viewer
• Updated
• 172k • 65
• 2
Viewer
• Updated
• 62.3k • 72
• 2
Viewer
• Updated
• 72.1k • 53
• 1
lianghsun/tw-instruct-500k
Viewer
• Updated
• 500k • 74
• 24