HussainKAUST commited on
Commit
37b0fb5
·
verified ·
1 Parent(s): 5ce3767

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +35 -3
README.md CHANGED
@@ -1,3 +1,35 @@
1
- ---
2
- license: mit
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language: ar
3
+ base_model: faisalq/SaudiBERT
4
+ tags:
5
+ - eou
6
+ - turn-taking
7
+ - arabic
8
+ - saudi
9
+ ---
10
+
11
+ # Saudi Arabic End-of-Utterance (EOU) Model
12
+
13
+ This is a fine-tuned **SaudiBERT** model for **End-of-Utterance (EOU) detection** in Saudi Arabic conversational text.
14
+
15
+ ## Task
16
+ Binary classification:
17
+ - 0 → Incomplete utterance
18
+ - 1 → End of utterance
19
+
20
+ ## Training
21
+ - Base model: faisalq/SaudiBERT
22
+ - Data: Saudi Arabic conversational dataset
23
+ - Loss: Focal Loss
24
+ - Metric: F1-score
25
+
26
+ ## Usage
27
+ ```python
28
+ from transformers import AutoTokenizer, AutoModelForSequenceClassification
29
+ import torch
30
+
31
+ tok = AutoTokenizer.from_pretrained("HussainKAUST/saudi-eou-model")
32
+ mdl = AutoModelForSequenceClassification.from_pretrained("HussainKAUST/saudi-eou-model")
33
+
34
+ x = tok("ابي احجز موعد بس ...", return_tensors="pt")
35
+ p = torch.sigmoid(mdl(**x).logits).item()