File size: 1,073 Bytes
37b0fb5 c1121fb 37b0fb5 c1121fb 37b0fb5 c1121fb 37b0fb5 c1121fb 2e23556 c1121fb |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 |
---
language: ar
base_model: faisalq/SaudiBERT
tags:
- arabic
- saudi
- eou
- turn-taking
- conversational-ai
license: mit
---
# Saudi Arabic End-of-Utterance (EOU) Model
This model detects **End-of-Utterance (EOU)** events in **Saudi Arabic conversational text**.
It outputs the probability that a speaker has **finished their turn**, enabling natural turn-taking in real-time voice agents (e.g., LiveKit).
---
## Task
Binary classification (probability output):
- **0** → Incomplete utterance (speaker likely to continue)
- **1** → Complete utterance (end of turn)
---
## Model Details
- **Base model:** `faisalq/SaudiBERT`
- **Architecture:** BERT Sequence Classification
- **Output:** Single probability (sigmoid)
- **Dialect focus:** Saudi Arabic (ar-SA)
---
## Training
- **Dataset:** Saudi Arabic conversational EOU dataset
https://huggingface.co/datasets/HussainKAUST/saudi-data-eou.jsonl
- **Data source:** Synthetic Saudi dialogue with natural pauses and incomplete turns
- **Loss:** Focal Loss (class imbalance handling)
- **Epochs:** 6
---
|