--- language: - en - hi license: apache-2.0 tags: - hate-speech-detection - reddit - xlm-roberta - hindi - english datasets: - HASOC2019 metrics: - accuracy - f1 model-index: - name: reddit-hate-speech-detector results: - task: type: text-classification metrics: - type: accuracy value: 0.8293 - type: f1 value: 0.8278 --- # Reddit Hate Speech Detector (Hindi + English) This model detects hate speech in Reddit comments for both Hindi and English languages. ## Model Description - **Base Model:** XLM-RoBERTa - **Languages:** Hindi, English - **Task:** Multi-task classification (hate speech detection + type + target) - **Accuracy:** 82.93% - **F1 Score:** 0.8278 ## Intended Use This model is designed for: - Content moderation on Reddit - Automated hate speech detection - Research purposes ⚠️ **Important:** This model should assist human moderators, not replace them. ## Usage ```python import torch from transformers import XLMRobertaTokenizer # Load tokenizer tokenizer = XLMRobertaTokenizer.from_pretrained('xlm-roberta-base') # Your model loading code here # (See inference script) ``` ## Training Data - HASOC 2019 Hindi Dataset - HASOC 2019 English Dataset - Combined training with class balancing ## Limitations - May have biases present in training data - Requires context for accurate detection - Cultural nuances may not be fully captured ## Ethical Considerations - Should be used transparently - Allow user appeals - Regular monitoring for fairness - Consider cultural context