File size: 3,517 Bytes

---
library_name: transformers
pipeline_tag: text-classification
tags:
- hate-speech
- arabic
- classification
- bert
- social-media
- moderation
language:
- ar
license: mit
datasets:
- IbrahimAmin/egyptian-arabic-hate-speech
metrics:
- accuracy
- f1
widget:
- text: هذا نص عربي للاختبار
base_model:
- CAMeL-Lab/bert-base-arabic-camelbert-da-sentiment
---

# Model Card for hossam87/bert-base-arabic-hate-speech

A fine-tuned BERT model to classify Arabic text into: Neutral, Offensive, Sexism, Religious Discrimination, or Racism.

---

## Model Details

### Model Description

This model is based on `bert-base-multilingual-cased` and fine-tuned on an Arabic social media dataset for hate speech detection.  
It classifies Arabic text into one of five categories: Neutral, Offensive, Sexism, Religious Discrimination, or Racism.  
Intended uses include moderation, analytics, and academic research.

- **Developed by:** [hossam87](https://huggingface.co/hossam87)
- **Model type:** Sequence classification (BERT)
- **Language(s):** Arabic (ar)
- **License:** MIT
- **Finetuned from model:** [bert-base-multilingual-cased](https://huggingface.co/bert-base-multilingual-cased)

### Model Sources

- **Repository:** [https://huggingface.co/hossam87/bert-base-arabic-hate-speech](https://huggingface.co/hossam87/bert-base-arabic-hate-speech)
- **Demo:** [https://huggingface.co/spaces/hossam87/arabic-hate-speech-detector](https://huggingface.co/spaces/hossam87/arabic-hate-speech-detector)

## Training Details

### Training Data

The model was fine-tuned on a labeled dataset of Arabic social media posts, manually annotated for the five target categories.

### Training Procedure

- **Precision:** Mixed precision (`fp16`)
- **Epochs:** 4 (best model at epoch 3)
- **Batch size:** 32
- **Learning rate:** 3e-5
- **Optimizer:** AdamW
- **Hardware:** 2 x NVIDIA T4 GPUs (Kaggle)

---

## Evaluation

### Metrics

| Metric   | Score  |
|----------|:------:|
| Accuracy | 0.944  |
| F1 Macro | 0.946  |


## Uses

### Direct Use

- Content moderation for Arabic social media, forums, and chats.
- Analytics and research into hate speech patterns in Arabic.
- Educational and academic projects.

### Out-of-Scope Use

- Automated moderation without human oversight in sensitive or legal contexts.
- Use on languages other than Arabic.
- General text classification tasks outside hate speech detection.

## Bias, Risks, and Limitations

The model may misclassify:
- Sarcasm, slang, or context-dependent expressions.
- Formal written Arabic, since trained on social media content.
- Domain-specific or emerging hate speech not represented in the training data.

### Recommendations

Always keep a human-in-the-loop for sensitive moderation tasks. Use responsibly and be transparent about automation.

## How to Get Started with the Model

```python
from transformers import AutoTokenizer, AutoModelForSequenceClassification, pipeline

model_id = "hossam87/bert-base-arabic-hate-speech"
tokenizer = AutoTokenizer.from_pretrained(model_id)
model = AutoModelForSequenceClassification.from_pretrained(model_id)

classifier = pipeline("text-classification", model=model, tokenizer=tokenizer)

text = "هذا نص عربي للاختبار"
result = classifier(text)
print(result)
```


```bibtex
@misc{hossam87_2025_arabichate,
  title = {BERT-base Arabic Hate Speech Detector},
  author = {Hossam87},
  year = {2025},
  howpublished = {\url{https://huggingface.co/hossam87/bert-base-arabic-hate-speech}},
}
```