hossam87
/

bert-base-arabic-hate-speech

Text Classification

text-embeddings-inference

Model card Files Files and versions

bert-base-arabic-hate-speech / README.md

hossam87's picture

Update README.md

ae367d7 verified 7 months ago

|

history blame contribute delete

3.52 kB

	---
	library_name: transformers
	pipeline_tag: text-classification
	tags:
	- hate-speech
	- arabic
	- classification
	- bert
	- social-media
	- moderation
	language:
	- ar
	license: mit
	datasets:
	- IbrahimAmin/egyptian-arabic-hate-speech
	metrics:
	- accuracy
	- f1
	widget:
	- text: هذا نص عربي للاختبار
	base_model:
	- CAMeL-Lab/bert-base-arabic-camelbert-da-sentiment
	---

	# Model Card for hossam87/bert-base-arabic-hate-speech

	A fine-tuned BERT model to classify Arabic text into: Neutral, Offensive, Sexism, Religious Discrimination, or Racism.

	---

	## Model Details

	### Model Description

	This model is based on `bert-base-multilingual-cased` and fine-tuned on an Arabic social media dataset for hate speech detection.
	It classifies Arabic text into one of five categories: Neutral, Offensive, Sexism, Religious Discrimination, or Racism.
	Intended uses include moderation, analytics, and academic research.

	- Developed by: [hossam87](https://huggingface.co/hossam87)
	- Model type: Sequence classification (BERT)
	- Language(s): Arabic (ar)
	- License: MIT
	- Finetuned from model: [bert-base-multilingual-cased](https://huggingface.co/bert-base-multilingual-cased)

	### Model Sources

	- Repository: [https://huggingface.co/hossam87/bert-base-arabic-hate-speech](https://huggingface.co/hossam87/bert-base-arabic-hate-speech)
	- Demo: [https://huggingface.co/spaces/hossam87/arabic-hate-speech-detector](https://huggingface.co/spaces/hossam87/arabic-hate-speech-detector)

	## Training Details

	### Training Data

	The model was fine-tuned on a labeled dataset of Arabic social media posts, manually annotated for the five target categories.

	### Training Procedure

	- Precision: Mixed precision (`fp16`)
	- Epochs: 4 (best model at epoch 3)
	- Batch size: 32
	- Learning rate: 3e-5
	- Optimizer: AdamW
	- Hardware: 2 x NVIDIA T4 GPUs (Kaggle)

	---

	## Evaluation

	### Metrics

	\| Metric \| Score \|
	\|----------\|:------:\|
	\| Accuracy \| 0.944 \|
	\| F1 Macro \| 0.946 \|


	## Uses

	### Direct Use

	- Content moderation for Arabic social media, forums, and chats.
	- Analytics and research into hate speech patterns in Arabic.
	- Educational and academic projects.

	### Out-of-Scope Use

	- Automated moderation without human oversight in sensitive or legal contexts.
	- Use on languages other than Arabic.
	- General text classification tasks outside hate speech detection.

	## Bias, Risks, and Limitations

	The model may misclassify:
	- Sarcasm, slang, or context-dependent expressions.
	- Formal written Arabic, since trained on social media content.
	- Domain-specific or emerging hate speech not represented in the training data.

	### Recommendations

	Always keep a human-in-the-loop for sensitive moderation tasks. Use responsibly and be transparent about automation.

	## How to Get Started with the Model

	```python
	from transformers import AutoTokenizer, AutoModelForSequenceClassification, pipeline

	model_id = "hossam87/bert-base-arabic-hate-speech"
	tokenizer = AutoTokenizer.from_pretrained(model_id)
	model = AutoModelForSequenceClassification.from_pretrained(model_id)

	classifier = pipeline("text-classification", model=model, tokenizer=tokenizer)

	text = "هذا نص عربي للاختبار"
	result = classifier(text)
	print(result)
	```


	```bibtex
	@misc{hossam87_2025_arabichate,
	title = {BERT-base Arabic Hate Speech Detector},
	author = {Hossam87},
	year = {2025},
	howpublished = {\url{https://huggingface.co/hossam87/bert-base-arabic-hate-speech}},
	}
	```