saudi-eou-model / README.md
HussainKAUST's picture
Update README.md
2e23556 verified
---
language: ar
base_model: faisalq/SaudiBERT
tags:
- arabic
- saudi
- eou
- turn-taking
- conversational-ai
license: mit
---
# Saudi Arabic End-of-Utterance (EOU) Model
This model detects **End-of-Utterance (EOU)** events in **Saudi Arabic conversational text**.
It outputs the probability that a speaker has **finished their turn**, enabling natural turn-taking in real-time voice agents (e.g., LiveKit).
---
## Task
Binary classification (probability output):
- **0** → Incomplete utterance (speaker likely to continue)
- **1** → Complete utterance (end of turn)
---
## Model Details
- **Base model:** `faisalq/SaudiBERT`
- **Architecture:** BERT Sequence Classification
- **Output:** Single probability (sigmoid)
- **Dialect focus:** Saudi Arabic (ar-SA)
---
## Training
- **Dataset:** Saudi Arabic conversational EOU dataset
https://huggingface.co/datasets/HussainKAUST/saudi-data-eou.jsonl
- **Data source:** Synthetic Saudi dialogue with natural pauses and incomplete turns
- **Loss:** Focal Loss (class imbalance handling)
- **Epochs:** 6
---