Cursed Toxic Pretraining Corpora
updated
mavinsao/reddit-mental-illness-82
Viewer
• Updated • 52.6k • 117
• 4
Viewer
• Updated • 2.17k • 106
• 3
RentonWEB3/reddit_dataset_193
Viewer
• Updated • 110k • 31
• 1
Updated • 881
• 9
hugginglearners/reddit-depression-cleaned
Viewer
• Updated • 7.73k • 327
• 1
chloeliu/reddit_nosleep_posts
Viewer
• Updated • 610 • 14
• 1
Viewer
• Updated • 4.41k • 71
• 40
gmongaras/reddit_negative
Viewer
• Updated • 4.86k • 11
Viewer
• Updated • 213k • 5
• 2
yoonholee/reddit_TwoSentencePlotTwist_1575
Viewer
• Updated • 1.58k • 5
ve-nk-at/reddit_comment_violation_data_set
Viewer
• Updated • 2.03k • 6
Viewer
• Updated • 11.9k • 25
• 3
DuckyBlender/racist-dataset
Viewer
• Updated • 1.31k • 16
• 6
taylorgordon/antisemitism_weak_labeling
Viewer
• Updated • 3.15k • 30
JoshMcGiff/HomophobiaDetectionTwitterX
Viewer
• Updated • 1.28k • 18
• 3
SetFit/hate_speech_offensive
Viewer
• Updated • 24.8k • 259
• 2
badmatr11x/hate-offensive-speech
Viewer
• Updated • 56.7k • 118
• 5
Intuit-GenSRF/hate-speech-offensive
Viewer
• Updated • 24.8k • 6
ctoraman/gender-hate-speech
Viewer
• Updated • 20k • 77
• 3
fuzzy-g/4chan_pol_whole_ds
Viewer
• Updated • 4.09M • 8
• 1
Skorcht/inceldatabaseTHISWASASUGGESTION
Viewer
• Updated • 2.49k • 8
• 1
Viewer
• Updated • 144k • 16
• 5
HuggingFaceTB/SmolVLM-Instruct
Image-Text-to-Text
• 2B • Updated • 29.3k
• 588
MultiverseComputingCAI/LittleLamb
Text Generation
• 0.3B • Updated • 3.83k
• 8