ujjawalsah's picture
Update README.md
f594cd0 verified
---
datasets:
- google/jigsaw_toxicity_pred
language:
- en
metrics:
- accuracy
---
# Multi-Label Hate Speech Classifier
## Overview
The **Multi-Label Hate Speech Classifier** is a machine learning model designed to detect and categorize multiple forms of hate speech within textual data. It leverages a OneVsRest Logistic Regression classifier combined with TF-IDF vectorization to analyze and classify text into multiple labels simultaneously.
## Features
- **Multi-Label Detection:** Assigns multiple hate speech categories to a single piece of text.
- **Supported Categories:**
- **toxic**
- **obscene**
- **insult**
- **threat**
- **identity_hate**
- **Custom Thresholds:** Optimized thresholds are applied to each label to balance precision and recall.
## Model Architecture
- **Text Vectorization:** Utilizes TF-IDF (Term Frequency-Inverse Document Frequency) to convert raw text into a numerical format.
- **Classifier:** Implements a OneVsRest Logistic Regression approach for multi-label classification.
- **Training Process:** Trained on a balanced dataset with pre-processed text to achieve robust performance across all categories.
## Setup & Installation
### Requirements
- Python 3.x
- Dependencies:
- `numpy`
- `pandas`
- `scikit-learn`
- `joblib`