|
|
--- |
|
|
language: |
|
|
- ar |
|
|
- en |
|
|
tags: |
|
|
- chat |
|
|
- translation |
|
|
license: apache-2.0 |
|
|
base_model: OMDA-Decoder |
|
|
pipeline_tag: text-generation |
|
|
--- |
|
|
|
|
|
# OMDA: Bilingual Arabic-English Chat LLM |
|
|
|
|
|
**Model Name:** OMDA |
|
|
**Architecture:** OMDA-Decoder (Custom Architecture) |
|
|
**Tokenizer:** OMDATokenizer (Custom Tokenizer) |
|
|
**Languages:** Arabic (Primary), English |
|
|
**Model Type:** Chat/Instruction-following |
|
|
**Author:** Binomda |
|
|
**Release Date:** 2025-06-28 |
|
|
|
|
|
## Model Overview |
|
|
OMDA is a compact bilingual language model specifically designed for Arabic-English conversational AI applications. Built with a custom decoder architecture, it excels at understanding and generating natural responses in both languages. |
|
|
|
|
|
## Model Specifications |
|
|
| Parameter | Value | |
|
|
|--------------------|-----------| |
|
|
| Layers | 6 | |
|
|
| Hidden Size | 512 | |
|
|
| Attention Heads | 8 | |
|
|
| FFN Dimension | 2048 | |
|
|
| Max Sequence Length| 512 | |
|
|
| Vocabulary Size | 128,004 | |
|
|
| Training Data | 1,000 curated Arabic-English conversation pairs | |
|
|
|
|
|
## Intended Uses |
|
|
โ
Chatbot development |
|
|
โ
Bilingual assistant applications |
|
|
โ
Educational tools |
|
|
โ
Basic translation tasks |
|
|
|
|
|
## Quick Start |
|
|
```python |
|
|
from transformers import pipeline |
|
|
|
|
|
# Initialize chat pipeline |
|
|
chatbot = pipeline("text-generation", model="BINOMDA/OMDA") |
|
|
|
|
|
# Arabic input example |
|
|
ar_responser = chatbot("ู
ุง ูู ุฑุฃูู ูู ุงูุชูููููุฌูุง ุงูุญุฏูุซุฉุ") |
|
|
|
|
|
# English input example |
|
|
en_response = chatbot("Explain artificial intelligence simply") |
|
|
|
|
|
## Intended Use |
|
|
- Chatbots, assistants, translation, and educational tools for Arabic/English. |
|
|
|
|
|
## Training |
|
|
- Trained for 5 epochs on 1000 samples. |
|
|
- Loss curve and checkpoints included. |
|
|
|
|
|
## Limitations |
|
|
- This is a small-scale demonstration model and may not generalize well to all real-world chat scenarios. |
|
|
- Not suitable for production use without further scaling, extensive evaluation, and safety checks. |
|
|
- Limited training data and model size may result in hallucinations or inaccurate translations. |
|
|
- No advanced filtering for inappropriate or biased outputs. |
|
|
- For research and educational purposes only. |
|
|
|
|
|
## Export & Deployment |
|
|
- See below for HuggingFace, llama.cpp, and ollama export |