File size: 7,011 Bytes
f45011a
 
 
 
 
 
 
4f89df1
 
f45011a
 
1d95c48
4f89df1
fa9f27d
4f89df1
 
 
 
 
 
 
 
 
 
 
 
 
63bcd50
4f89df1
 
 
 
 
 
 
63bcd50
4f89df1
 
 
63bcd50
 
 
 
 
 
 
4f89df1
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
f3c6ad0
4f89df1
954e003
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
---
title: README
emoji: 
colorFrom: pink
colorTo: yellow
sdk: static
pinned: false
thumbnail: >-
  https://cdn-uploads.huggingface.co/production/uploads/685d281ebd8c51629778c12c/nWsXyYbkwEvBfqh-81JbD.png
---

# [INFERENCE Lab](https://www.inference-lab.org)

**Applied AI Engineering & Research Lab**

Inference Lab is an applied AI research and engineering organization. We develop production-grade AI systems, construct high-quality datasets for underrepresented languages, and publish reproducible research across low-resource NLP, speech intelligence, and AI deployment. Our work is end-to-end: from raw data collection and annotation through model training, evaluation, and deployment as usable software.

---

## Research

### Low-Resource NLP

**Data-Centric Roman Urdu NLP: High-Quality Dataset Curation, Privacy-Preserving Embeddings, and State-of-the-Art Model Benchmarking**

A comprehensive data-centric study addressing the critical gaps in Roman Urdu NLP infrastructure. Covers rigorous dataset curation methodology, privacy-preserving embedding strategies, and systematic benchmarking of state-of-the-art models on Roman Urdu classification tasks. Establishes reproducible baselines for future work in this domain.

→ [Read here](https://doi.org/10.5281/zenodo.18080524)

---

**RUEmoCorp: A Large-Scale Roman Urdu Emotion Corpus with Cross-Institute Annotation Validation and State-of-the-Art Emotion Classification**

Construction and release of the largest Roman Urdu emotion recognition corpus to date. Introduces a cross-institute annotation validation framework with structured annotator roles, multi-round calibration, and Inter-Annotator Agreement (IAA) measurement. Accompanies the current state-of-the-art emotion classifier for Roman Urdu.

→ [Read here](https://doi.org/10.21203/rs.3.rs-9759243/v1)

---

**RUDaSA: Roman Urdu Dataset for Sentiment Analysis — A Large-Scale, Curated Corpus with Privacy-Preserving Embeddings and Competitive Benchmarking of Transformer Models**

RUDaSA is a large-scale Roman Urdu sentiment analysis benchmark that provides privacy-preserving embeddings and evaluates state-of-the-art transformer models to advance NLP research for low-resource and code-mixed languages.

→ [Read here](https://doi.org/10.21203/rs.3.rs-9827763/v1)


### Speech AI

**Modeling Vocal Fatigue as Embedding-Space Deviation Using Contrastively Trained ECAPA-TDNNs**

A novel approach to vocal fatigue detection that frames the problem as deviation measurement in speaker embedding space rather than direct classification. A contrastively trained ECAPA-TDNN encoder is used to capture speaker-specific vocal baselines; fatigue is quantified as geometric distance from the healthy reference embedding. Introduces the ECAPA-TDNN-VHE architecture, achieving 2.5× performance improvement over the standard ECAPA-TDNN baseline.

→ [Preprint](https://doi.org/10.5281/zenodo.18366305)

---

**Continuous Vocal Load Monitoring in Professional Voice Users: Development and Occupational Validation of an Automated Assessment System**

A complete occupational health monitoring system for professional voice users — teachers, call center operators, broadcasters, and clinical staff. Addresses the gap between laboratory vocal fatigue research and deployable real-world monitoring tools. Validated against occupational use conditions with a focus on practical deployment in professional environments.

→ *Under Review — Journal of Voice*

---

## Datasets

**RUEmoCorp** — Largest curated Roman Urdu Emotion Corpus

Multi-class emotion recognition corpus for Roman Urdu, constructed with structured annotation pipelines, cross-institute validation, and rigorous quality control. Supports research in low-resource affective computing and multilingual NLP.

[HuggingFace](#) · [Harvard Dataverse](https://doi.org/10.7910/DVN/BPWHOZ)

---

**Roman Urdu Sentiment Corpus** — Largest curated Roman Urdu Sentiment Corpus

Large-scale sentiment corpus for Roman Urdu, released with full documentation of collection methodology, annotation schema, and inter-annotator agreement statistics. Serves as the benchmark dataset for Roman Urdu sentiment classification.

[HuggingFace](https://huggingface.co/datasets/Khubaib01/RomanUrdu-NLP-Sentiment-Corpus) · [Harvard Dataverse](https://doi.org/10.7910/DVN/TMXDCL)

---

## Models

**ECAPA-TDNN-VHE** — Vocal Health Encoder

Custom ECAPA-TDNN architecture trained contrastively for vocal health assessment. Encodes speaker vocal characteristics into a health-sensitive embedding space. Achieves 2.5× performance improvement over the standard ECAPA-TDNN baseline on vocal fatigue detection benchmarks.

[HuggingFace](https://huggingface.co/Khubaib01/ECAPA-TDNN-VHE)

---

**Roman Urdu Emotion Classifier** — Current State of the Art

XLM-RoBERTa fine-tuned on RUEmoCorp for multi-class Roman Urdu emotion recognition. Macro F1: 0.9896. The highest-performing publicly available model for this task.

[HuggingFace](https://huggingface.co/Khubaib01/roman-urdu-emotion-xlmr-v2)

---

**Roman Urdu Sentiment Classifier** — Current State of the Art

XLM-RoBERTa fine-tuned on the Roman Urdu Sentiment Corpus. The highest-performing publicly available model for Roman Urdu sentiment classification.

[HuggingFace](https://huggingface.co/Khubaib01/roman-urdu-sentiment-xlmr)

---

## Software

**VoiceMonitor**
Python library for continuous vocal load monitoring. Designed for integration into occupational health workflows, real-time audio pipelines, and professional voice user monitoring systems.

**Auralis VFS**
Vocal fatigue scoring library. Provides a programmable interface for fatigue quantification using the ECAPA-TDNN-VHE encoder. Designed for clinical and occupational deployment scenarios.

**VocalID**
Voice biometrics library for speaker verification and identification. Built for security-sensitive applications requiring speaker authentication from raw audio.

**faker-pk**
Localized synthetic data generation library for Pakistan. Generates realistic dummy data — names, addresses, CNICs, phone numbers, and institutional identifiers — for database seeding, system testing, and privacy-safe development workflows.

---

## Standards

Every release from Inference Lab adheres to the following:

- Reproducible training and evaluation pipelines with public code
- Rigorous evaluation reporting — macro F1, per-class metrics, confidence intervals where applicable
- Documented data collection, annotation methodology, and IAA statistics
- Deployable inference code alongside model weights
- Honest documentation of limitations and failure cases
- Archival publication on Harvard Dataverse with permanent DOIs for all datasets

---

## Contact

**[Muhammad Khubaib Ahmad](https://www.linkedin.com/in/muhammad-khubaib-ahmad-)** — Founder, Lead Researcher & Engineer

[Gmail: inferencelab.ai@gmail.com](mailto:inferencelab.ai@gmail.com) 

[GitHub](https://github.com/Khubaib01)

Multan, Punjab, Pakistan