arabic_eou_model / README.md
raghad23's picture
Create README.md
dd48de7 verified
metadata
language: ar
tags:
  - arabic
  - saudi-dialect
  - eou-detection
  - livekit
  - voice-agent
  - real-time

Arabic EOU Model โ€“ Saudi Dialect (raghad23/arabic-eou-model90)

High-performance End-of-Utterance (EOU) detection model for real-time Arabic voice agents (LiveKit, etc.).

Task

Binary classification:

  • LABEL_0 โ†’ Speaker continues
  • LABEL_1 โ†’ End of turn (EOU)

Performance (Test Set)

Metric Score
Accuracy 90.68%
Weighted F1 93.40%
Precision (EOU) 99.83%
Recall (EOU) 87.75%
Eval Loss 0.1964

Excellent real-time performance โ€” detects end-of-utterance accurately in Saudi Arabic dialect.

Dataset

414k Saudi dialect samples
https://huggingface.co/datasets/raghad23/arabic_eou_sada_dataset

Base Model

aubmindlab/bert-base-arabertv02

Latency

< 80ms โ†’ perfect for LiveKit deployment

Usage

from transformers import pipeline
pipe = pipeline("text-classification", model="raghad23/arabic-eou-model90")
pipe("ุชู…ุงู… ุงู„ุญู…ุฏู„ู„ู‡ ูˆุดู„ูˆู†ูƒ")  # โ†’ LABEL_1 = EOU