Update README.md
Browse files
README.md
CHANGED
|
@@ -7,4 +7,51 @@ sdk: static
|
|
| 7 |
pinned: false
|
| 8 |
---
|
| 9 |
|
| 10 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 7 |
pinned: false
|
| 8 |
---
|
| 9 |
|
| 10 |
+
# Orature AI - Pioneering Urdu Language AI
|
| 11 |
+
|
| 12 |
+
**Mission:** Orature AI is dedicated to advancing the frontiers of Artificial Intelligence, and Language Models for the Urdu language. We aim to develop computationally efficient, culturally-aware, and accessible language technologies that empower local communities, researchers, and businesses. Our work focuses on bridging the linguistic digital divide and promoting equitable and sustainable AI development.
|
| 13 |
+
|
| 14 |
+
**Vision:** To be a leading force in creating and democratizing state-of-the-art NLP resources for Urdu, a low-resource language, fostering innovation and inclusivity in the global AI landscape.
|
| 15 |
+
|
| 16 |
+
## About Us
|
| 17 |
+
|
| 18 |
+
Orature AI has emerged from the foundational work of the ALIF الف project, a Final Year Project at Habib University (Spring 2025). Our core team comprises passionate researchers and engineers committed to open-source principles and collaborative innovation.
|
| 19 |
+
|
| 20 |
+
**Core Team (Founders of ALIF الف):**
|
| 21 |
+
* Syed Muhammad Ali Naqvi
|
| 22 |
+
* Zainab Haider
|
| 23 |
+
* Syeda Haya Fatima
|
| 24 |
+
* Hammad Sajid
|
| 25 |
+
* Ali Muhammad Asad
|
| 26 |
+
|
| 27 |
+
**Affiliation:**
|
| 28 |
+
* Habib University, Dhanani School of Science and Engineering
|
| 29 |
+
|
| 30 |
+
## Our Focus Areas
|
| 31 |
+
|
| 32 |
+
* **Data Curation & Tokenization:** Novel creation and meticulous preprocessing of large-scale, culturally relevant datasets and language-specific tokenizer.
|
| 33 |
+
* **Urdu Language Model Development:** Creating robust pretrained and instruction-tuned Small Language Models (SLMs) for Urdu.
|
| 34 |
+
* **Low-Resource NLP:** Developing scalable frameworks and methodologies for building language models for underrepresented languages.
|
| 35 |
+
* **Open Source Contribution:** Sharing models, datasets, and research findings with the global community.
|
| 36 |
+
* **Sustainable AI:** Advocating for efficient and environmentally conscious AI practices.
|
| 37 |
+
|
| 38 |
+
## Our Flagship Project: ALIF الف
|
| 39 |
+
|
| 40 |
+
The **ALIF الف** project represents our initial and core contribution, featuring a series of Urdu pretrained generative models, custom tokenizers, and comprehensive datasets.
|
| 41 |
+
<!-- * [Link to ALIF Project Paper/Website (if separate from HF)]
|
| 42 |
+
* [Link to ALIF Models on Hugging Face]
|
| 43 |
+
* [Link to ALIF Datasets on Hugging Face] -->
|
| 44 |
+
|
| 45 |
+
<!-- ## Values
|
| 46 |
+
|
| 47 |
+
* **Openness:** We believe in the power of open-source to accelerate research and development.
|
| 48 |
+
* **Inclusivity:** We strive to make AI accessible and beneficial for all linguistic communities.
|
| 49 |
+
* **Rigor:** We are committed to high-quality research and meticulous development practices.
|
| 50 |
+
* **Collaboration:** We welcome partnerships and contributions from the wider community.
|
| 51 |
+
* **Impact:** We aim to create AI solutions that have a tangible positive impact. -->
|
| 52 |
+
|
| 53 |
+
## Get Involved
|
| 54 |
+
|
| 55 |
+
* **Explore our Models & Datasets:** Browse our contributions on the Hugging Face Hub.
|
| 56 |
+
<!-- * **Contribute:** We encourage contributions to our open-source projects. Check out our GitHub repositories [Link to Orature AI GitHub Org, if applicable]. -->
|
| 57 |
+
<!-- * **Contact Us:** For collaborations, inquiries, or feedback, please reach out to [YOUR_ORATURE_AI_EMAIL_OR_CONTACT_METHOD]. -->
|