File size: 4,621 Bytes
bae4925 00ab3ee bae4925 00ab3ee bae4925 00ab3ee bae4925 00ab3ee |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 |
---
title: Multilingual Hate Speech Detector
emoji: π‘οΈ
colorFrom: red
colorTo: blue
sdk: gradio
sdk_version: 4.44.0
app_file: app.py
pinned: false
license: mit
short_description: Hate speech detector
models:
- xlm-roberta-base
datasets:
- hate-speech
---
# π‘οΈ Multilingual Hate Speech Detector
**Advanced AI system for detecting hate speech in English and Serbian text with innovative contextual analysis**
## π¬ Key Innovations
### 1. **Contextual Analysis** π
- **Word-level importance highlighting** using transformer attention weights
- Visual explanation showing which words most influenced the classification decision
- Color-coded highlighting: π΄ Red (high influence) β π Orange β π‘ Yellow β βͺ Gray (low influence)
### 2. **Confidence Visualization** π
- Interactive Plotly charts showing model confidence across **all 8 categories**
- Real-time confidence distribution analysis
- Color-coded bars distinguishing hate speech categories from appropriate content
### 3. **Interactive Feedback System** π¬
- User rating system (1-5 stars) for continuous model improvement
- Feedback collection for enhancing accuracy
- Community-driven model refinement
## π Hate Speech Categories
The system detects 8 categories:
- **Race**: Racial discrimination and slurs
- **Sexual Orientation**: Homophobic content, LGBTQ+ discrimination
- **Gender**: Sexist content, misogyny, gender-based harassment
- **Physical Appearance**: Body shaming, lookism, appearance-based harassment
- **Religion**: Religious discrimination, islamophobia, antisemitism
- **Class**: Classist content, economic discrimination
- **Disability**: Ableist content, discrimination against disabled people
- **Appropriate**: Non-hateful, normal conversation
## π Multilingual Support
- **English**: Comprehensive hate speech detection
- **Serbian**: Native Serbian language support with Cyrillic and Latin scripts
- **Cross-lingual**: XLM-RoBERTa architecture enables robust multilingual understanding
## π§ Technical Architecture
- **Base Model**: XLM-RoBERTa (Cross-lingual Language Model)
- **Training**: Fine-tuned on multilingual hate speech datasets
- **Attention Mechanism**: Transformer attention weights for explainable AI
- **Real-time Processing**: Optimized for instant classification
- **GPU Acceleration**: CUDA support for faster inference
## π How to Use
1. **Input Text**: Enter any text in English or Serbian
2. **Analyze**: Click "Analyze Text" for instant classification
3. **Review Results**: See category prediction with confidence score
4. **Examine Context**: Check word-level highlighting to understand the decision
5. **View Confidence**: Analyze the confidence distribution chart
6. **Provide Feedback**: Rate the analysis to help improve the model
## π― Example Analyses
### Appropriate Content
```
"I really enjoyed that movie last night! Great acting and storyline."
β β
Appropriate (95% confidence)
```
### Hate Speech Detection
```
"You people are all the same, always causing problems everywhere."
β β οΈ Race (87% confidence)
```
### Serbian Language
```
"Ovaj film je bio odliΔan, preporuΔujem svima!"
β β
Appropriate (92% confidence)
```
## β‘ Performance
- **Accuracy**: High-confidence predictions with detailed explanations
- **Speed**: Real-time processing (< 2 seconds per analysis)
- **Languages**: English and Serbian with cross-lingual capabilities
- **Explainability**: Visual attention analysis for transparent decisions
## π οΈ Local Development
```bash
# Clone the repository
git clone <repository-url>
cd hate-speech-detector
# Install dependencies
pip install -r requirements.txt
# Run the application
python app.py
```
## π Research & Education
This AI system is designed for:
- **Research purposes**: Understanding hate speech patterns
- **Educational use**: Learning about AI explainability
- **Content moderation**: Assisting human moderators
- **Linguistic analysis**: Cross-lingual hate speech research
## β οΈ Important Notes
- Results should be interpreted carefully
- Human judgment should always be applied for critical decisions
- The system is designed to assist, not replace, human moderation
- Continuous improvement through user feedback
## π€ Contributing
We welcome feedback and contributions! Please use the interactive feedback system within the application to help improve model accuracy.
## π License
MIT License - See LICENSE file for details
---
**β‘ Powered by**: Transformer Neural Networks | **π Languages**: English, Serbian | **π― Focus**: Explainable AI
|