AliMuhammad73 commited on
Commit
6a5c6c9
·
verified ·
1 Parent(s): a8959c3

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +48 -1
README.md CHANGED
@@ -7,4 +7,51 @@ sdk: static
7
  pinned: false
8
  ---
9
 
10
- Edit this `README.md` markdown file to author your organization card.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
7
  pinned: false
8
  ---
9
 
10
+ # Orature AI - Pioneering Urdu Language AI
11
+
12
+ **Mission:** Orature AI is dedicated to advancing the frontiers of Artificial Intelligence, and Language Models for the Urdu language. We aim to develop computationally efficient, culturally-aware, and accessible language technologies that empower local communities, researchers, and businesses. Our work focuses on bridging the linguistic digital divide and promoting equitable and sustainable AI development.
13
+
14
+ **Vision:** To be a leading force in creating and democratizing state-of-the-art NLP resources for Urdu, a low-resource language, fostering innovation and inclusivity in the global AI landscape.
15
+
16
+ ## About Us
17
+
18
+ Orature AI has emerged from the foundational work of the ALIF الف project, a Final Year Project at Habib University (Spring 2025). Our core team comprises passionate researchers and engineers committed to open-source principles and collaborative innovation.
19
+
20
+ **Core Team (Founders of ALIF الف):**
21
+ * Syed Muhammad Ali Naqvi
22
+ * Zainab Haider
23
+ * Syeda Haya Fatima
24
+ * Hammad Sajid
25
+ * Ali Muhammad Asad
26
+
27
+ **Affiliation:**
28
+ * Habib University, Dhanani School of Science and Engineering
29
+
30
+ ## Our Focus Areas
31
+
32
+ * **Data Curation & Tokenization:** Novel creation and meticulous preprocessing of large-scale, culturally relevant datasets and language-specific tokenizer.
33
+ * **Urdu Language Model Development:** Creating robust pretrained and instruction-tuned Small Language Models (SLMs) for Urdu.
34
+ * **Low-Resource NLP:** Developing scalable frameworks and methodologies for building language models for underrepresented languages.
35
+ * **Open Source Contribution:** Sharing models, datasets, and research findings with the global community.
36
+ * **Sustainable AI:** Advocating for efficient and environmentally conscious AI practices.
37
+
38
+ ## Our Flagship Project: ALIF الف
39
+
40
+ The **ALIF الف** project represents our initial and core contribution, featuring a series of Urdu pretrained generative models, custom tokenizers, and comprehensive datasets.
41
+ <!-- * [Link to ALIF Project Paper/Website (if separate from HF)]
42
+ * [Link to ALIF Models on Hugging Face]
43
+ * [Link to ALIF Datasets on Hugging Face] -->
44
+
45
+ <!-- ## Values
46
+
47
+ * **Openness:** We believe in the power of open-source to accelerate research and development.
48
+ * **Inclusivity:** We strive to make AI accessible and beneficial for all linguistic communities.
49
+ * **Rigor:** We are committed to high-quality research and meticulous development practices.
50
+ * **Collaboration:** We welcome partnerships and contributions from the wider community.
51
+ * **Impact:** We aim to create AI solutions that have a tangible positive impact. -->
52
+
53
+ ## Get Involved
54
+
55
+ * **Explore our Models & Datasets:** Browse our contributions on the Hugging Face Hub.
56
+ <!-- * **Contribute:** We encourage contributions to our open-source projects. Check out our GitHub repositories [Link to Orature AI GitHub Org, if applicable]. -->
57
+ <!-- * **Contact Us:** For collaborations, inquiries, or feedback, please reach out to [YOUR_ORATURE_AI_EMAIL_OR_CONTACT_METHOD]. -->