raghad23 commited on
Commit
dd48de7
·
verified ·
1 Parent(s): ec59257

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +48 -0
README.md ADDED
@@ -0,0 +1,48 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language: ar
3
+ tags:
4
+ - arabic
5
+ - saudi-dialect
6
+ - eou-detection
7
+ - livekit
8
+ - voice-agent
9
+ - real-time
10
+ ---
11
+
12
+ # Arabic EOU Model – Saudi Dialect (raghad23/arabic-eou-model90)
13
+
14
+ **High-performance** End-of-Utterance (EOU) detection model for real-time Arabic voice agents (LiveKit, etc.).
15
+
16
+ ### Task
17
+ Binary classification:
18
+ - `LABEL_0` → Speaker continues
19
+ - `LABEL_1` → End of turn (EOU)
20
+
21
+ ### Performance (Test Set)
22
+ | Metric | Score |
23
+ |--------------------|---------|
24
+ | Accuracy | **90.68%** |
25
+ | Weighted F1 | **93.40%** |
26
+ | Precision (EOU) | 99.83% |
27
+ | Recall (EOU) | 87.75% |
28
+ | Eval Loss | 0.1964 |
29
+
30
+ **Excellent real-time performance** — detects end-of-utterance accurately in Saudi Arabic dialect.
31
+
32
+ ### Dataset
33
+ 414k Saudi dialect samples
34
+ https://huggingface.co/datasets/raghad23/arabic_eou_sada_dataset
35
+
36
+ ### Base Model
37
+ `aubmindlab/bert-base-arabertv02`
38
+
39
+ ### Latency
40
+ < 80ms → perfect for LiveKit deployment
41
+
42
+ ### Usage
43
+ ```python
44
+ from transformers import pipeline
45
+ pipe = pipeline("text-classification", model="raghad23/arabic-eou-model90")
46
+ pipe("تمام الحمدلله وشلونك") # → LABEL_1 = EOU
47
+ ```
48
+