|
|
--- |
|
|
datasets: |
|
|
- google/jigsaw_toxicity_pred |
|
|
language: |
|
|
- en |
|
|
metrics: |
|
|
- accuracy |
|
|
--- |
|
|
# Multi-Label Hate Speech Classifier |
|
|
|
|
|
## Overview |
|
|
The **Multi-Label Hate Speech Classifier** is a machine learning model designed to detect and categorize multiple forms of hate speech within textual data. It leverages a OneVsRest Logistic Regression classifier combined with TF-IDF vectorization to analyze and classify text into multiple labels simultaneously. |
|
|
|
|
|
## Features |
|
|
- **Multi-Label Detection:** Assigns multiple hate speech categories to a single piece of text. |
|
|
- **Supported Categories:** |
|
|
- **toxic** |
|
|
- **obscene** |
|
|
- **insult** |
|
|
- **threat** |
|
|
- **identity_hate** |
|
|
- **Custom Thresholds:** Optimized thresholds are applied to each label to balance precision and recall. |
|
|
|
|
|
## Model Architecture |
|
|
- **Text Vectorization:** Utilizes TF-IDF (Term Frequency-Inverse Document Frequency) to convert raw text into a numerical format. |
|
|
- **Classifier:** Implements a OneVsRest Logistic Regression approach for multi-label classification. |
|
|
- **Training Process:** Trained on a balanced dataset with pre-processed text to achieve robust performance across all categories. |
|
|
|
|
|
## Setup & Installation |
|
|
|
|
|
### Requirements |
|
|
- Python 3.x |
|
|
- Dependencies: |
|
|
- `numpy` |
|
|
- `pandas` |
|
|
- `scikit-learn` |
|
|
- `joblib` |