ujjawalsah
/

hate-speech-multi-label-classifier

Model card Files Files and versions

hate-speech-multi-label-classifier / README.md

ujjawalsah's picture

Update README.md

f594cd0 verified 11 months ago

|

history blame contribute delete

1.29 kB

	---
	datasets:
	- google/jigsaw_toxicity_pred
	language:
	- en
	metrics:
	- accuracy
	---
	# Multi-Label Hate Speech Classifier

	## Overview
	The Multi-Label Hate Speech Classifier is a machine learning model designed to detect and categorize multiple forms of hate speech within textual data. It leverages a OneVsRest Logistic Regression classifier combined with TF-IDF vectorization to analyze and classify text into multiple labels simultaneously.

	## Features
	- Multi-Label Detection: Assigns multiple hate speech categories to a single piece of text.
	- Supported Categories:
	- toxic
	- obscene
	- insult
	- threat
	- identity_hate
	- Custom Thresholds: Optimized thresholds are applied to each label to balance precision and recall.

	## Model Architecture
	- Text Vectorization: Utilizes TF-IDF (Term Frequency-Inverse Document Frequency) to convert raw text into a numerical format.
	- Classifier: Implements a OneVsRest Logistic Regression approach for multi-label classification.
	- Training Process: Trained on a balanced dataset with pre-processed text to achieve robust performance across all categories.

	## Setup & Installation

	### Requirements
	- Python 3.x
	- Dependencies:
	- `numpy`
	- `pandas`
	- `scikit-learn`
	- `joblib`