Alif: Advancing Urdu Large Language Models via Multilingual Synthetic Data Distillation
Paper • 2510.09051 • Published
output = llm(
"Once upon a time,",
max_tokens=512,
echo=True
)
print(output)Alif 1.0 3B Instruct is an open-source instruction-tuned language model developed by traversaal.ai, focused on Urdu and English understanding and reasoning. Fine-tuned with high-quality, culturally aware synthetic data, Alif delivers strong multilingual performance with a particular emphasis on low-resource language alignment and nuanced instruction following.
unsloth/Llama-3.2-3BAlif-1.0-3B-InstructAlif is ideal for:
@article{ShafiqueAlif2025,
title = {Alif: Advancing Urdu Large Language Models via Multilingual Synthetic Data Distillation},
author = {Muhammad Ali Shafique and Kanwal Mehreen and Muhammad Arham and Maaz Amjad and Sabur Butt and Hamza Farooq},
journal = {arXiv preprint arXiv:2510.09051},
year = {2025},
url = {https://arxiv.org/abs/2510.09051}
}
4-bit
# !pip install llama-cpp-python from llama_cpp import Llama llm = Llama.from_pretrained( repo_id="large-traversaal/Alif-1.0-3B-Instruct", filename="Alif-1.0-3B-Instruct-Q4_K_M.gguf", )