Datasets
updated
Viewer
• Updated • 183k • 1.67k
• 295
Viewer
• Updated • 2.94M • 18.4k
• 1.55k
Viewer
• Updated • 1.33k • 2.62k
• 465
Viewer
• Updated • 1M • 16k
• 862
databricks/databricks-dolly-15k
Viewer
• Updated • 15k • 36.3k
• 986
togethercomputer/RedPajama-Data-1T
Viewer
• Updated • 1.73M • 2.05k
• 1.17k
Viewer
• Updated • 201k • 69
• 33
Viewer
• Updated • 6.29k • 12.3k
• 7
Viewer
• Updated • 64.3k • 6.69k
• 15
Viewer
• Updated • 9.35M • 5.62k
• 13
Viewer
• Updated • 2.68M • 2.7k
• 4
Viewer
• Updated • 6.87k • 12.9k
• 6
Viewer
• Updated • 4.64M • 1.01k
• 18
Viewer
• Updated • 5.54M • 807
• 3
Viewer
• Updated • 5.33M • 1.23k
• 16
Viewer
• Updated • 538k • 1.06k
• 4
mteb/arxiv-clustering-s2s
Viewer
• Updated • 31 • 2.54k
• 1
Viewer
• Updated • 68.1k • 87
• 11
Viewer
• Updated • 21.4k • 63
• 2
mteb/amazon_reviews_multi
Viewer
• Updated • 2.52M • 2.34k
• 29
Viewer
• Updated • 19.9k • 2.93k
• 17
Updated • 1.07k
• 3
mteb/toxic_conversations_50k
Viewer
• Updated • 100k • 2.28k
• 19
mteb/tweet_sentiment_extraction
Viewer
• Updated • 30.2k • 4.41k
• 38
Viewer
• Updated • 5.34k • 47.5k
• 8
mteb/sts22-crosslingual-sts
Viewer
• Updated • 17.2k • 10.3k
• 14
Viewer
• Updated • 7.96k • 1.61k
• 2
mteb/stackoverflowdupquestions-reranking
Viewer
• Updated • 22.8k • 990
• 3
reach-vb/jenny_tts_dataset
Viewer
• Updated • 21k • 357
• 36
ai4privacy/pii-masking-200k
Viewer
• Updated • 209k • 3.31k
• 123
ai4privacy/pii-masking-300k
Viewer
• Updated • 225k • 4.59k
• 106
bigcode/bigcode-pii-dataset-training
Viewer
• Updated • 11.9k • 26
• 11
TypicaAI/pii-masking-60k_fr
Viewer
• Updated • 61.9k • 66
• 2
davanstrien/code-prompt-similarity-model
Sentence Similarity
• 0.1B • Updated • 10
• 6
Viewer
• Updated • 2.34M • 1.53k
• 162
Preview
• Updated • 6.13k
• 50
Image-Text-to-Text
• 9B • Updated • 5.06k
• 195