| | --- |
| | language: ar |
| | tags: |
| | - arabic |
| | - saudi-dialect |
| | - eou-detection |
| | - livekit |
| | - voice-agent |
| | - real-time |
| | --- |
| | |
| | # Arabic EOU Model โ Saudi Dialect (raghad23/arabic-eou-model90) |
| |
|
| | **High-performance** End-of-Utterance (EOU) detection model for real-time Arabic voice agents (LiveKit, etc.). |
| |
|
| | ### Task |
| | Binary classification: |
| | - `LABEL_0` โ Speaker continues |
| | - `LABEL_1` โ End of turn (EOU) |
| |
|
| | ### Performance (Test Set) |
| | | Metric | Score | |
| | |--------------------|---------| |
| | | Accuracy | **90.68%** | |
| | | Weighted F1 | **93.40%** | |
| | | Precision (EOU) | 99.83% | |
| | | Recall (EOU) | 87.75% | |
| | | Eval Loss | 0.1964 | |
| |
|
| | **Excellent real-time performance** โ detects end-of-utterance accurately in Saudi Arabic dialect. |
| |
|
| | ### Dataset |
| | 414k Saudi dialect samples |
| | https://huggingface.co/datasets/raghad23/arabic_eou_sada_dataset |
| | |
| | ### Base Model |
| | `aubmindlab/bert-base-arabertv02` |
| | |
| | ### Latency |
| | < 80ms โ perfect for LiveKit deployment |
| | |
| | ### Usage |
| | ```python |
| | from transformers import pipeline |
| | pipe = pipeline("text-classification", model="raghad23/arabic-eou-model90") |
| | pipe("ุชู
ุงู
ุงูุญู
ุฏููู ูุดูููู") # โ LABEL_1 = EOU |
| | ``` |
| | |
| | |