T0KII
/

MASRI-RF

Model card Files Files and versions

MASRI-RF / README.md

T0KII's picture

Create README.md

c91a9f0 verified 10 days ago

|

history blame contribute delete

1.28 kB

	# MASRI-RF-XGB: The Egyptian Arabic Ensemble Judge

	MASRI-RF-XGB is the conductor of the Kalamna committee. It is an XGBoost meta-learner that takes the probability outputs from both MASRIHEADS (Transformer) and BIHEADS (RNN) to make a final, highly robust prediction.

	## Stacking Strategy
	The Maestro doesn't just look at the predictions; it analyzes the consensus and conflict between models. It uses a 39-dimensional meta-feature vector per sample:
	1. MASRIHEADS Probs (13 dims): Emotion(8), Sentiment(3), Sarcasm(2).
	2. BIHEADS Probs (13 dims): Emotion(8), Sentiment(3), Sarcasm(2).
	3. Deltas (13 dims): The absolute difference $\|Masri - Bi\|$ for every class.

	## Components
	- Primary Expert: `T0KII/MASRIHEADS`
	- Secondary Expert: `T0KII/BIHEADS`
	- Conductor: XGBoost Classifier

	## Final Performance (Validation Splits)
	\| Task \| Macro-F1 \| Data Split \|
	\|-----------\|----------\|------------\|
	\| Emotion \| 0.9072 \| emotone_ar val \|
	\| Sarcasm \| 0.7304 \| ar_sarcasm committee val \|
	\| Sentiment \| 0.7692 \| multi-source sentiment val \|


	## Use Case
	Use this model when you need the highest possible accuracy for Egyptian dialect analysis, particularly in cases of heavy slang or subtle sarcasm where single models may struggle.