MASRI-RF / README.md
T0KII's picture
Create README.md
c91a9f0 verified

MASRI-RF-XGB: The Egyptian Arabic Ensemble Judge

MASRI-RF-XGB is the conductor of the Kalamna committee. It is an XGBoost meta-learner that takes the probability outputs from both MASRIHEADS (Transformer) and BIHEADS (RNN) to make a final, highly robust prediction.

Stacking Strategy

The Maestro doesn't just look at the predictions; it analyzes the consensus and conflict between models. It uses a 39-dimensional meta-feature vector per sample:

  1. MASRIHEADS Probs (13 dims): Emotion(8), Sentiment(3), Sarcasm(2).
  2. BIHEADS Probs (13 dims): Emotion(8), Sentiment(3), Sarcasm(2).
  3. Deltas (13 dims): The absolute difference $|Masri - Bi|$ for every class.

Components

  • Primary Expert: T0KII/MASRIHEADS
  • Secondary Expert: T0KII/BIHEADS
  • Conductor: XGBoost Classifier

Final Performance (Validation Splits)

Task Macro-F1 Data Split
Emotion 0.9072 emotone_ar val
Sarcasm 0.7304 ar_sarcasm committee val
Sentiment 0.7692 multi-source sentiment val

Use Case

Use this model when you need the highest possible accuracy for Egyptian dialect analysis, particularly in cases of heavy slang or subtle sarcasm where single models may struggle.