hate speech detector

by AuricErgeson - opened May 30

Owner May 30

•

I trained a hate speech detector that catches coded language

Most existing models miss stuff like "they control the media" or
"heil hitler" , they were trained on explicit slurs only.

I fused 4 datasets (Davidson, ImplicitHate, HateXplain, HateDay 2025)

targeted augmentation for neo-Nazi codes, antisemitic dog whistles,
and white nationalist phrases.

Results on 11K held-out examples:

One thing it still misses: bare "1488" as a standalone token.
If you've solved this open an issue, I'm curious.

#NLP #HateSpeechDetection #ContentModeration #TextClassification

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment