horus-egy-coder / README.md
BLACK0X80's picture
Update README.md
ffa9277 verified
|
Raw
History Blame Contribute Delete
4.74 kB
---
language:
- ar
- en
license: apache-2.0
base_model: meta-llama/Llama-3.1-8B
tags:
- llama
- arabic
- egyptian
- coding
- text-generation
- conversational
- unsloth
- trl
- fine-tuned
pipeline_tag: text-generation
library_name: transformers
---
# Horus-Egy-Coder
**Egyptian Arabic coding assistant** — a fine-tuned Llama 3.1-8B model specialized in helping Arabic and Egyptian developers write, debug, and understand code.
> Built in Egypt 🇪🇬 | Powered by [Unsloth](https://github.com/unslothai/unsloth) + TRL
---
## Model Description
`horus-egy-coder` is a fine-tuned version of `meta-llama/Llama-3.1-8B-Instruct` trained to assist Arabic-speaking developers — with native understanding of Egyptian Arabic technical dialect. It can explain code in Arabic, help debug in conversational Egyptian, and write clean code with Arabic comments.
| Property | Value |
|---|---|
| **Base model** | meta-llama/Llama-3.1-8B-Instruct |
| **Parameters** | 1B (quantized BF16) |
| **Training** | Unsloth + HuggingFace TRL (2x faster) |
| **License** | Apache 2.0 |
| **Language** | Arabic 🇪🇬 / English |
| **Task** | Code generation, explanation, debugging |
---
## Quick Start
```python
from transformers import AutoTokenizer, AutoModelForCausalLM
import torch
model_name = "BLACK0X80/horus-egy-coder"
tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModelForCausalLM.from_pretrained(
model_name,
torch_dtype=torch.bfloat16,
device_map="auto"
)
messages = [
{
"role": "system",
"content": "أنت مساعد برمجة بيتكلم عربي مصري. بتشرح الكود بالعربي وبتساعد المطورين."
},
{
"role": "user",
"content": "اكتبلي function بـ Python بتحسب الـ factorial"
}
]
input_ids = tokenizer.apply_chat_template(
messages,
tokenize=True,
add_generation_prompt=True,
return_tensors="pt"
).to(model.device)
output = model.generate(
input_ids,
max_new_tokens=512,
temperature=0.7,
do_sample=True
)
print(tokenizer.decode(output[0][input_ids.shape[1]:], skip_special_tokens=True))
```
---
## Example Interactions
**Example 1 — Code explanation in Arabic:**
```
User: فسرلي الكود ده
for i in range(len(arr)-1):
for j in range(len(arr)-i-1):
if arr[j] > arr[j+1]:
arr[j], arr[j+1] = arr[j+1], arr[j]
Model: ده Bubble Sort — خوارزمية ترتيب بسيطة.
بتعمل loop على الـ array وبتقارن كل عنصر بالي بعده،
لو العنصر الأكبر جه الأول بتبدلهم...
```
**Example 2 — Debugging:**
```
User: الكود ده بيطلع IndexError، مش عارف ليه
Model: المشكلة في السطر ده... جرب كذا...
```
---
🔢 GGUF Version (للأجهزة المحدودة)
لو جهازك مش عنده GPU كبير، استخدم النسخة المضغوطة:
[`BLACK0X80/horus-egy-coder-Q4_K_M-GGUF`](https://huggingface.co/BLACK0X80/horus-egy-coder-Q4_K_M-GGUF)
```bash
# تشغيل بـ llama.cpp
./llama-cli -m horus-egy-coder-Q4_K_M.gguf \
-p "أنت مساعد برمجة عربي مصري." \
--chat-format llama-3
```
---
## Training Details
- **Framework:** [Unsloth](https://github.com/unslothai/unsloth) (2x faster training, 60% less memory)
- **Library:** HuggingFace TRL (SFT Trainer)
- **Base:** `unsloth/Meta-Llama-3.1-8B-Instruct-bnb-4bit`
- **Quantization:** BF16
---
## Intended Use
**Good for:**
- شرح الكود بالعربي المصري
- كتابة functions وscripts بـ Python
- Debugging مع تفسير الأخطاء بالعربي
- تعليم البرمجة للمبتدئين العرب
**Limitations:**
- النموذج fine-tuned على بيانات محدودة — ممكن يغلط في كود معقد جداً
- مش متخصص في لغات غير Python بنفس الكفاءة
- مش مناسب لـ production code بدون مراجعة بشرية
---
## About HORUS-AI
HORUS-AI is an Egyptian AI initiative focused on building Arabic and Egyptian-dialect AI models. Named after the ancient Egyptian god of knowledge, we aim to make AI accessible to Arabic-speaking developers.
- GitHub: [BLACK0X80](https://github.com/BLACK0X80)
- Web: [black0x80.vercel.app](https://black0x80.vercel.app)
---
## Citation
```bibtex
@misc{horus-egy-coder-2025,
title={Horus-Egy-Coder: Egyptian Arabic Coding Assistant},
author={BLACK0X80 and HORUS-AI},
year={2025},
url={https://huggingface.co/BLACK0X80/horus-egy-coder},
note={Fine-tuned from meta-llama/Llama-3.1-8B-Instruct}
}
```
---
*Made with ❤️ in Egypt 🇪🇬*