common-dataset
updated
HuggingFaceH4/ultrachat_200k
Viewer
•
Updated
•
515k
•
28.3k
•
632
Text Generation
•
7B
•
Updated
•
4.81k
•
307
shareAI/ShareGPT-Chinese-English-90k
Preview
•
Updated
•
1.88k
•
272
Viewer
•
Updated
•
207M
•
15.3k
•
470
lmsys/chatbot_arena_conversations
Viewer
•
Updated
•
33k
•
1.5k
•
432
Viewer
•
Updated
•
968M
•
46.9k
•
881
WizardLMTeam/WizardLM_evol_instruct_70k
Viewer
•
Updated
•
70k
•
557
•
195
LargeWorldModel/LWM-Text-Chat-1M
Text Generation
•
Updated
•
1.52k
•
174
Updated
•
993
•
120
microsoft/orca-math-word-problems-200k
Viewer
•
Updated
•
200k
•
9k
•
466
Preview
•
Updated
•
67
•
27
Viewer
•
Updated
•
52.5B
•
178k
•
2.58k
Yukang/LongAlpaca-16k-length
Viewer
•
Updated
•
6.28k
•
63
•
25
Viewer
•
Updated
•
51.8k
•
25.7k
•
752
Viewer
•
Updated
•
343M
•
487
•
10
NousResearch/json-mode-eval
Viewer
•
Updated
•
100
•
525
•
40
NousResearch/func-calling-eval-singleturn
Viewer
•
Updated
•
112
•
28
•
7
NousResearch/func-calling-eval-glaive
Viewer
•
Updated
•
100
•
39
•
8
legacy-datasets/wikipedia
Updated
•
55.1k
•
608
Viewer
•
Updated
•
10.4B
•
632k
•
498
open-web-math/open-web-math
Viewer
•
Updated
•
6.32M
•
8.8k
•
324
codeparrot/github-code-clean
Viewer
•
Updated
•
11M
•
17.7k
•
132
HuggingFaceFW/fineweb-edu-score-2
Viewer
•
Updated
•
13.9B
•
26.3k
•
82
HuggingFaceFW/fineweb-edu
Viewer
•
Updated
•
3.5B
•
269k
•
890
Viewer
•
Updated
•
52k
•
50.7k
•
844
Viewer
•
Updated
•
772k
•
46
•
26
YeungNLP/WizardLM_evol_instruct_V2_143k
Viewer
•
Updated
•
143k
•
156
•
11
Viewer
•
Updated
•
2.94M
•
13.8k
•
1.48k
WizardLMTeam/WizardLM_evol_instruct_V2_196k
Viewer
•
Updated
•
143k
•
1.34k
•
246
timdettmers/openassistant-guanaco
Viewer
•
Updated
•
10.4k
•
5.4k
•
437
garage-bAInd/Open-Platypus
Viewer
•
Updated
•
24.9k
•
4.34k
•
412
Viewer
•
Updated
•
3.71M
•
847k
•
545
Updated
•
856
•
224
Salesforce/xlam-function-calling-60k
Viewer
•
Updated
•
60k
•
3.4k
•
559
HuggingFaceTB/smollm-corpus
Viewer
•
Updated
•
237M
•
14.8k
•
408
glaiveai/glaive-function-calling-v2
Viewer
•
Updated
•
113k
•
2.35k
•
478
mlfoundations/dclm-baseline-1.0-parquet
Viewer
•
Updated
•
2.73B
•
5.26k
•
32
mlfoundations/dclm-baseline-1.0
Preview
•
Updated
•
472k
•
250
ruslanmv/ai-medical-chatbot
Viewer
•
Updated
•
257k
•
1.35k
•
245
Viewer
•
Updated
•
100k
•
12.6k
•
256
Viewer
•
Updated
•
470M
•
44.9k
•
321
xzuyn/manythings-translations-alpaca
Viewer
•
Updated
•
6.33M
•
76
•
8
Viewer
•
Updated
•
21.9M
•
12.9k
•
688
Viewer
•
Updated
•
1.75M
•
141
•
103
mlabonne/open-perfectblend
Viewer
•
Updated
•
1.42M
•
492
•
57
mlabonne/orca-agentinstruct-1M-v1-cleaned
Viewer
•
Updated
•
1.05M
•
166
•
63
allenai/tulu-3-sft-mixture
Viewer
•
Updated
•
939k
•
11.1k
•
206
NovaSky-AI/Sky-T1_data_17k
Viewer
•
Updated
•
16.4k
•
283
•
186
Viewer
•
Updated
•
552M
•
1.81k
•
2
Viewer
•
Updated
•
78.1M
•
568
•
5
Viewer
•
Updated
•
1.13M
•
319
•
10
Viewer
•
Updated
•
16.2M
•
920
•
1
Viewer
•
Updated
•
172k
•
304
•
2
Viewer
•
Updated
•
62.3k
•
29
•
2
Viewer
•
Updated
•
72.1k
•
128
•
1
lianghsun/tw-instruct-500k
Viewer
•
Updated
•
500k
•
169
•
23