File size: 1,073 Bytes
37b0fb5
 
 
 
 
 
c1121fb
 
 
 
37b0fb5
 
 
 
c1121fb
 
 
 
37b0fb5
 
c1121fb
 
 
 
 
 
 
 
 
 
 
 
 
 
37b0fb5
 
c1121fb
2e23556
c1121fb
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
---
language: ar
base_model: faisalq/SaudiBERT
tags:
- arabic
- saudi
- eou
- turn-taking
- conversational-ai
license: mit
---

# Saudi Arabic End-of-Utterance (EOU) Model

This model detects **End-of-Utterance (EOU)** events in **Saudi Arabic conversational text**.  
It outputs the probability that a speaker has **finished their turn**, enabling natural turn-taking in real-time voice agents (e.g., LiveKit).

---

## Task
Binary classification (probability output):

- **0** → Incomplete utterance (speaker likely to continue)
- **1** → Complete utterance (end of turn)

---

## Model Details
- **Base model:** `faisalq/SaudiBERT`
- **Architecture:** BERT Sequence Classification
- **Output:** Single probability (sigmoid)
- **Dialect focus:** Saudi Arabic (ar-SA)

---

## Training
- **Dataset:** Saudi Arabic conversational EOU dataset  
  https://huggingface.co/datasets/HussainKAUST/saudi-data-eou.jsonl
- **Data source:** Synthetic Saudi dialogue with natural pauses and incomplete turns
- **Loss:** Focal Loss (class imbalance handling)
- **Epochs:** 6

---