Khubaib01 commited on
Commit
63bcd50
·
verified ·
1 Parent(s): 954e003

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +9 -2
README.md CHANGED
@@ -25,7 +25,7 @@ Inference Lab is an applied AI research and engineering organization. We develop
25
 
26
  A comprehensive data-centric study addressing the critical gaps in Roman Urdu NLP infrastructure. Covers rigorous dataset curation methodology, privacy-preserving embedding strategies, and systematic benchmarking of state-of-the-art models on Roman Urdu classification tasks. Establishes reproducible baselines for future work in this domain.
27
 
28
- → [Preprint](https://doi.org/10.5281/zenodo.18080524)
29
 
30
  ---
31
 
@@ -33,10 +33,17 @@ A comprehensive data-centric study addressing the critical gaps in Roman Urdu NL
33
 
34
  Construction and release of the largest Roman Urdu emotion recognition corpus to date. Introduces a cross-institute annotation validation framework with structured annotator roles, multi-round calibration, and Inter-Annotator Agreement (IAA) measurement. Accompanies the current state-of-the-art emotion classifier for Roman Urdu.
35
 
36
- *Under Progress*
37
 
38
  ---
39
 
 
 
 
 
 
 
 
40
  ### Speech AI
41
 
42
  **Modeling Vocal Fatigue as Embedding-Space Deviation Using Contrastively Trained ECAPA-TDNNs**
 
25
 
26
  A comprehensive data-centric study addressing the critical gaps in Roman Urdu NLP infrastructure. Covers rigorous dataset curation methodology, privacy-preserving embedding strategies, and systematic benchmarking of state-of-the-art models on Roman Urdu classification tasks. Establishes reproducible baselines for future work in this domain.
27
 
28
+ → [Read here](https://doi.org/10.5281/zenodo.18080524)
29
 
30
  ---
31
 
 
33
 
34
  Construction and release of the largest Roman Urdu emotion recognition corpus to date. Introduces a cross-institute annotation validation framework with structured annotator roles, multi-round calibration, and Inter-Annotator Agreement (IAA) measurement. Accompanies the current state-of-the-art emotion classifier for Roman Urdu.
35
 
36
+ [Read here](https://doi.org/10.21203/rs.3.rs-9759243/v1)
37
 
38
  ---
39
 
40
+ **RUDaSA: Roman Urdu Dataset for Sentiment Analysis — A Large-Scale, Curated Corpus with Privacy-Preserving Embeddings and Competitive Benchmarking of Transformer Models**
41
+
42
+ RUDaSA is a large-scale Roman Urdu sentiment analysis benchmark that provides privacy-preserving embeddings and evaluates state-of-the-art transformer models to advance NLP research for low-resource and code-mixed languages.
43
+
44
+ → [Read here](https://doi.org/10.21203/rs.3.rs-9827763/v1)
45
+
46
+
47
  ### Speech AI
48
 
49
  **Modeling Vocal Fatigue as Embedding-Space Deviation Using Contrastively Trained ECAPA-TDNNs**