ูKhaliji_Qwen-3.5: The First Khaliji Dialect LLM
๐ Introduction
Welcome to the future of Levantine AI.
We are thrilled to introduce Khaliji_Qwen-3.5, the first series of Large Language Models specifically fine-tuned to understand and generate the Khaliji Arabic Dialect.
While most Arabic LLMs focus on Modern Standard Arabic (MSA/Fusha), they often fail to capture the nuance, warmth, and cultural specificity of local dialects. We changed that. By leveraging the powerful Qwen 3.5 architecture, we have fine-tuned this model not just to "speak Arabic," but to speak Khaliji.
From the streets of Egypt, this model understands the local idioms, slang, and cultural context that define Khaliji communication.
๐ Key Features
- ๐ฃ๏ธ Native Dialect: Trained specifically on Khaliji colloquial data, not just MSA.
- ๐ง Smart & Small: Built on efficient Qwen small-model architecture for fast inference.
- ๐ค Community First: Open weights with a strong commitment to the open-source ecosystem.
๐ ๏ธ How to Load & Use
Getting started with Khaliji_Qwen-3.5 is seamless. You can load it using the standard transformers library.
Requirements
pip install transformers torch accelerate
Inference Code
from transformers import AutoModelForCausalLM, AutoTokenizer
model_name = "theBOrg32/Khaliji_qwen_3.5_2B"
# Load tokenizer and model
tokenizer = AutoTokenizer.from_pretrained(model_name, trust_remote_code=True)
model = AutoModelForCausalLM.from_pretrained(
model_name,
device_map="auto",
trust_remote_code=True,
torch_dtype="auto"
)
# Prepare your Khaliji Dialect prompt
prompt = "ุฅุฒูู ูุง ููุฏู
ุ"
inputs = tokenizer(prompt, return_tensors="pt").to(model.device)
# Generate
outputs = model.generate(
**inputs,
max_new_tokens=256,
temperature=0.5,
do_sample=True
)
response = tokenizer.decode(outputs[0], skip_special_tokens=True)
print(response)
๐ฌ Examples & Benchmarks
We tested Khaliji_Qwen-3.5 against general-purpose models. Notice how our model captures the tone and dialect while others revert to formal Arabic or English.
| Prompt (Khaliji Dialect) | Khaliji_Qwen-3.5 (Ours) | OpenAI | Gemini |
|---|---|---|---|
| ู ุงุฐุง ุชุนุฑู ุนู ุงูู ูุทู | ุฃูุง ู ุณุงุนุฏ ุฐูุงุก ุตูุงุนู ุ ูู ุง ุฃูุฏุฑ ุฃุชุญูู ู ู ูุงูููุงู ุ ุจุณ ุนูู ุญุณุจ ุงูุจูุงูุงุช ุงูู ุชููุฑุฉุ ุงูู ูุทู ูู ุฌุฒุก ู ู ุงูุฑูุงุถูุงุช ูุงูููุณูุฉุ ููุนุทูู ุทุฑู ุชููุฑ ูุชุชุตุฑู ุจุดูู ู ูุทูู. ูุงููู ูุณู ูููู ุงูู ูุทู ุงูุฑูุงุถู ูุณู ููู ุจุนุฏ ุงูู ูุทู ุงูููุงุณูููุ ููู ูุนุชุจุฑ ุฃุณุงุณ ููุฑูุงุถูุงุช ุงูุญุฏูุซุฉ. ูุจุฑุถู ุจุนุฏ ูุณู ูููู ุงูู ูุทู ุงูุฑู ุฒู ุฃู ุงูู ูุทู ุงูุฑูุงุถูุ ููู ููุน ู ู ุงูุฑูุงุถูุงุช ูุฑูุฒ ุนูู ุงุณุชุฎุฏุงู ุงูุฑู ูุฒ ูุงูุฑู ูุฒ ุงูุฑูุงุถูุฉ ุนุดุงู ููุตููู ูููุฑุฉ ู ุนูููุฉ. ูุงูู ูุถูุน ุจุนุฏ ูู ุนูุงูุฉ ุจุงูุจุฑู ุฌุฉ ูุชุทููุฑ ุงูุจุฑู ุฌูุงุชุ ูุฃู ููู ูู ุทุฑููุฉ ุชุณุชุฎุฏู ุงูู ูุทู ุนุดุงู ุชุญุณูู ุฃุฏุงุก ุงูุจุฑูุงู ุฌ. ูุจุฑุถู ุจุนุฏ ูู ุนูุงูุฉ ุจุนูู ุงูููุณุ ูุฃู ููู ูุงุณ ูุณู ูููู ุงูู ูุทู ุงูููุณูุ ููุฑุฌุนูู ููู ููุฌูุฉ ุงูุนูู ูุฉ ุนุดุงู ููุตููู ูููู ุฃุนู ู ูุณููู ุงูุฅูุณุงู. | ุงูู ู ูุทู ูู ูุฑุน ู ู ูุฑูุน ุงูููุณูุฉ ูุงูุฑูุงุถูุงุช ููุชู ุจุฏุฑุงุณุฉ ููุงุนุฏ ุงูุชูููุฑ ุงูุตุญูุญ ูุงูุงุณุชุฏูุงู. ูุนูู ุจุจุณุงุทุฉ: ููู ูู ููุฒ ุจูู ุงูุญุฌุฉ ุงูุตุญูุญุฉ ูุงูุญุฌุฉ ุงูุฎุงุทุฆุฉุ ูููู ูุจูู ุงุณุชูุชุงุฌุงุช ุณููู ุฉ ู ู ู ุนููู ุงุช ู ูุฌูุฏุฉ. ๐ง ู ุง ุงูุฐู ูุฏุฑุณู ุงูู ูุทูุ ุงูู ูุทู ููุชู ุจุฃุดูุงุก ู ุซู: ุงูุงุณุชุฏูุงู: ููู ูุตู ู ู ู ูุฏู ุงุช ุฅูู ูุชูุฌุฉ | ุงูู ูุทู ูู "ุขูุฉ ุนูููุฉ" ุชุนุตู ุงูุฐูู ู ู ุงููููุน ูู ุงูุฎุทุฃุ ุฃู ุจุจุณุงุทุฉ ูู ุนูู ุฏุฑุงุณุฉ ููุงุนุฏ ุงูุชูููุฑ ุงูุณููู ูููููุฉ ุงุณุชุฎูุงุต ุงููุชุงุฆุฌ ู ู ุงูู ูุฏู ุงุช. Wikipedia Wikipedia +1 ุฅููู ุฃูู ู ุง ูู ูุฒู ุจุงุฎุชุตุงุฑ: ุงูุชุนุฑูู ุงูููุณูู: ุงุนุชุจุฑู ุงุจู ุณููุง ูู "ุงูู ูุฒุงู" ุงูุฐู ูู ูุฒ ุงูุญู ู ู ุงูุจุงุทูุ ููุณุจุชู ููู ุนุงูู ู ุซู ูุณุจุฉ ุงููุญู ููููุงู . |
Note: the full response is too long for all models so we trimmed, the idea here is just to show that even the SOTA models cannot handle the dialect.
โ๏ธ License & Commercial Use
We are strong believers in the Open Source Community. To ensure this technology remains accessible and beneficial to everyone, we have chosen a Copyleft License.
๐ License: CC-BY-SA-4.0
This model is released under the Creative Commons Attribution-ShareAlike 4.0 International License.
๐ค Usage Guidelines
- โ Open Source Projects: You are free to use, fine-tune, and distribute this model in your projects, provided your project also remains open-source and references Khaliji_Qwen-3.5.
- โ Commercial Use: Commercial usage is allowed under the terms of CC-BY-SA-4.0 (your derivative models must remain open).
- ๐ Closed Source / Proprietary: If you wish to integrate this model (or a fine-tuned version) into a closed-source product without releasing your weights/code, you must obtain prior approval.
๐ง For Closed-Source Licensing: Please contact us at info2@the-borg.ru to discuss agreements that respect our open-source mission.
๐ Credits & Acknowledgments
This model would not be possible without the foundational work of the Qwen Team at Alibaba Cloud. We stand on the shoulders of giants.
- Base Model: Qwen 3.5
- Fine-Tuning & Alignment: The Borg Organization
- Dataset: Curated Syrian Dialect Corpus
Citation
If you use Khaliji_Qwen-3.5 in your research or project, please cite us:
@misc{Khaliji_qwen_2026,
title={Khaliji_Qwen-3.5: The First Khaliji Dialect Large Language Model},
author={The Borg Organization},
year={2026},
license={CC-BY-SA-4.0}
}
Built with โค๏ธ for the Khaliji Community & The World
Preserving language, one token at a time.
- Downloads last month
- 7