MASRIHEADS / README.md
T0KII's picture
Create README.md
6f7094f verified

MASRIHEADS: Multi-Task Egyptian Arabic BERT

MASRIHEADS is a multi-task transformer-based model designed for the Egyptian Arabic dialect (Masri). It performs simultaneous classification for Sarcasm, Sentiment, and Emotion by leveraging a shared backbone and specialized task heads.

Model Details

  • Base Model: T0KII/MASRIBERTv2
  • Architecture: BERT-based Multi-Task Classifier
  • Max Sequence Length: 128
  • Language: Egyptian Arabic (arz)

Task Heads & Labels

The model is equipped with three classification heads:

  • Sarcasm (2 classes): not sarcastic, sarcastic
  • Sentiment (3 classes): negative, neutral, positive
  • Emotion (8 classes): none, anger, joy, sadness, love, sympathy, surprise, fear

Training Data

The model was trained on a consolidated dataset of Egyptian dialect text, including:

  • ar_sarcasm
  • ArSarcasm_v2
  • iSarcasmEval
  • egyptian-arabic-fake-reviews
  • egyptian_sentiment_analysis_dataset
  • emotone_ar

Performance (Test Set F1)

Task Macro-F1
Sarcasm 0.7882
Sentiment 0.7671
Emotion 0.7649
Mean 0.7734