Why always qwen miss arabic even though 430 million people speak Arabic

by yousef1727 - opened Dec 24, 2025

Dec 24, 2025

Even though 430M speak Arabic, Qwen struggles because training data is mostly English, Arabic is morphologically complex, has many dialects, and tokenization often breaks words.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment