Arabic-English translation benchmark: MSA vs dialectal performance

#10

by O96a - opened 20 days ago

Ran a quick benchmark on OPUS-MT Arabic-English with 9 test cases across formal, technical, and dialectal inputs.

Key findings:

MSA (Modern Standard Arabic): Strong performance, 3–14s latency
Technical content: Handles ML/API terminology well, preserves code-switching
Dialectal Arabic: Significant truncation — Egyptian "إزيك؟ كله تمام؟" reduced to "I was gonna ask you something" (missed the greeting entirely)
Sudanese "يا زول" outputted as "Hey, Zol" with untranslated term

Latency range: 0.4s (simple) to 13.5s (technical sentences)

For production Arabic NLP pipelines, OPUS-MT works well for MSA but dialectal preprocessing would improve coverage.

Has anyone tested this against NLLB-200 for Arabic dialect coverage?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment