Spaces:
Running
Running
File size: 7,011 Bytes
f45011a 4f89df1 f45011a 1d95c48 4f89df1 fa9f27d 4f89df1 63bcd50 4f89df1 63bcd50 4f89df1 63bcd50 4f89df1 f3c6ad0 4f89df1 954e003 | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 | ---
title: README
emoji: ⚡
colorFrom: pink
colorTo: yellow
sdk: static
pinned: false
thumbnail: >-
https://cdn-uploads.huggingface.co/production/uploads/685d281ebd8c51629778c12c/nWsXyYbkwEvBfqh-81JbD.png
---
# [INFERENCE Lab](https://www.inference-lab.org)
**Applied AI Engineering & Research Lab**
Inference Lab is an applied AI research and engineering organization. We develop production-grade AI systems, construct high-quality datasets for underrepresented languages, and publish reproducible research across low-resource NLP, speech intelligence, and AI deployment. Our work is end-to-end: from raw data collection and annotation through model training, evaluation, and deployment as usable software.
---
## Research
### Low-Resource NLP
**Data-Centric Roman Urdu NLP: High-Quality Dataset Curation, Privacy-Preserving Embeddings, and State-of-the-Art Model Benchmarking**
A comprehensive data-centric study addressing the critical gaps in Roman Urdu NLP infrastructure. Covers rigorous dataset curation methodology, privacy-preserving embedding strategies, and systematic benchmarking of state-of-the-art models on Roman Urdu classification tasks. Establishes reproducible baselines for future work in this domain.
→ [Read here](https://doi.org/10.5281/zenodo.18080524)
---
**RUEmoCorp: A Large-Scale Roman Urdu Emotion Corpus with Cross-Institute Annotation Validation and State-of-the-Art Emotion Classification**
Construction and release of the largest Roman Urdu emotion recognition corpus to date. Introduces a cross-institute annotation validation framework with structured annotator roles, multi-round calibration, and Inter-Annotator Agreement (IAA) measurement. Accompanies the current state-of-the-art emotion classifier for Roman Urdu.
→ [Read here](https://doi.org/10.21203/rs.3.rs-9759243/v1)
---
**RUDaSA: Roman Urdu Dataset for Sentiment Analysis — A Large-Scale, Curated Corpus with Privacy-Preserving Embeddings and Competitive Benchmarking of Transformer Models**
RUDaSA is a large-scale Roman Urdu sentiment analysis benchmark that provides privacy-preserving embeddings and evaluates state-of-the-art transformer models to advance NLP research for low-resource and code-mixed languages.
→ [Read here](https://doi.org/10.21203/rs.3.rs-9827763/v1)
### Speech AI
**Modeling Vocal Fatigue as Embedding-Space Deviation Using Contrastively Trained ECAPA-TDNNs**
A novel approach to vocal fatigue detection that frames the problem as deviation measurement in speaker embedding space rather than direct classification. A contrastively trained ECAPA-TDNN encoder is used to capture speaker-specific vocal baselines; fatigue is quantified as geometric distance from the healthy reference embedding. Introduces the ECAPA-TDNN-VHE architecture, achieving 2.5× performance improvement over the standard ECAPA-TDNN baseline.
→ [Preprint](https://doi.org/10.5281/zenodo.18366305)
---
**Continuous Vocal Load Monitoring in Professional Voice Users: Development and Occupational Validation of an Automated Assessment System**
A complete occupational health monitoring system for professional voice users — teachers, call center operators, broadcasters, and clinical staff. Addresses the gap between laboratory vocal fatigue research and deployable real-world monitoring tools. Validated against occupational use conditions with a focus on practical deployment in professional environments.
→ *Under Review — Journal of Voice*
---
## Datasets
**RUEmoCorp** — Largest curated Roman Urdu Emotion Corpus
Multi-class emotion recognition corpus for Roman Urdu, constructed with structured annotation pipelines, cross-institute validation, and rigorous quality control. Supports research in low-resource affective computing and multilingual NLP.
[HuggingFace](#) · [Harvard Dataverse](https://doi.org/10.7910/DVN/BPWHOZ)
---
**Roman Urdu Sentiment Corpus** — Largest curated Roman Urdu Sentiment Corpus
Large-scale sentiment corpus for Roman Urdu, released with full documentation of collection methodology, annotation schema, and inter-annotator agreement statistics. Serves as the benchmark dataset for Roman Urdu sentiment classification.
[HuggingFace](https://huggingface.co/datasets/Khubaib01/RomanUrdu-NLP-Sentiment-Corpus) · [Harvard Dataverse](https://doi.org/10.7910/DVN/TMXDCL)
---
## Models
**ECAPA-TDNN-VHE** — Vocal Health Encoder
Custom ECAPA-TDNN architecture trained contrastively for vocal health assessment. Encodes speaker vocal characteristics into a health-sensitive embedding space. Achieves 2.5× performance improvement over the standard ECAPA-TDNN baseline on vocal fatigue detection benchmarks.
[HuggingFace](https://huggingface.co/Khubaib01/ECAPA-TDNN-VHE)
---
**Roman Urdu Emotion Classifier** — Current State of the Art
XLM-RoBERTa fine-tuned on RUEmoCorp for multi-class Roman Urdu emotion recognition. Macro F1: 0.9896. The highest-performing publicly available model for this task.
[HuggingFace](https://huggingface.co/Khubaib01/roman-urdu-emotion-xlmr-v2)
---
**Roman Urdu Sentiment Classifier** — Current State of the Art
XLM-RoBERTa fine-tuned on the Roman Urdu Sentiment Corpus. The highest-performing publicly available model for Roman Urdu sentiment classification.
[HuggingFace](https://huggingface.co/Khubaib01/roman-urdu-sentiment-xlmr)
---
## Software
**VoiceMonitor**
Python library for continuous vocal load monitoring. Designed for integration into occupational health workflows, real-time audio pipelines, and professional voice user monitoring systems.
**Auralis VFS**
Vocal fatigue scoring library. Provides a programmable interface for fatigue quantification using the ECAPA-TDNN-VHE encoder. Designed for clinical and occupational deployment scenarios.
**VocalID**
Voice biometrics library for speaker verification and identification. Built for security-sensitive applications requiring speaker authentication from raw audio.
**faker-pk**
Localized synthetic data generation library for Pakistan. Generates realistic dummy data — names, addresses, CNICs, phone numbers, and institutional identifiers — for database seeding, system testing, and privacy-safe development workflows.
---
## Standards
Every release from Inference Lab adheres to the following:
- Reproducible training and evaluation pipelines with public code
- Rigorous evaluation reporting — macro F1, per-class metrics, confidence intervals where applicable
- Documented data collection, annotation methodology, and IAA statistics
- Deployable inference code alongside model weights
- Honest documentation of limitations and failure cases
- Archival publication on Harvard Dataverse with permanent DOIs for all datasets
---
## Contact
**[Muhammad Khubaib Ahmad](https://www.linkedin.com/in/muhammad-khubaib-ahmad-)** — Founder, Lead Researcher & Engineer
[Gmail: inferencelab.ai@gmail.com](mailto:inferencelab.ai@gmail.com)
[GitHub](https://github.com/Khubaib01)
Multan, Punjab, Pakistan |