File size: 1,285 Bytes

---
datasets:
- google/jigsaw_toxicity_pred
language:
- en
metrics:
- accuracy
---
# Multi-Label Hate Speech Classifier

## Overview
The **Multi-Label Hate Speech Classifier** is a machine learning model designed to detect and categorize multiple forms of hate speech within textual data. It leverages a OneVsRest Logistic Regression classifier combined with TF-IDF vectorization to analyze and classify text into multiple labels simultaneously.

## Features
- **Multi-Label Detection:** Assigns multiple hate speech categories to a single piece of text.
- **Supported Categories:**
  - **toxic**
  - **obscene**
  - **insult**
  - **threat**
  - **identity_hate**
- **Custom Thresholds:** Optimized thresholds are applied to each label to balance precision and recall.

## Model Architecture
- **Text Vectorization:** Utilizes TF-IDF (Term Frequency-Inverse Document Frequency) to convert raw text into a numerical format.
- **Classifier:** Implements a OneVsRest Logistic Regression approach for multi-label classification.
- **Training Process:** Trained on a balanced dataset with pre-processed text to achieve robust performance across all categories.

## Setup & Installation

### Requirements
- Python 3.x
- Dependencies:
  - `numpy`
  - `pandas`
  - `scikit-learn`
  - `joblib`