Spaces:

Steveeeeeeen
/

how-biased-is-whisper

Sleeping

App Files Files Community

Steveeeeeeen HF Staff commited on Jan 29

Commit

74703b5

verified ·

1 Parent(s): 2ba944a

Update constants.py

Browse files

Files changed (1) hide show

constants.py +30 -8

constants.py CHANGED Viewed

@@ -12,14 +12,6 @@ banner_url = "https://huggingface.co/datasets/reach-vb/random-images/resolve/mai
 BANNER = f'<div style="display: flex; justify-content: space-around;"><img src="{banner_url}" alt="Banner" style="width: 40vw; min-width: 300px; max-width: 600px;"> </div>'
 EXPLANATION = """
-    ## Why EdAcc Matters for ASR Evaluation
-    The EdAcc dataset is specifically designed to evaluate the robustness of Automatic Speech Recognition (ASR) models across diverse accents and demographics. This leaderboard helps you:
-    * **Assess Accent Fairness**: Compare model performance across 30+ different accents and speaker demographics
-    * **Evaluate Real-World Robustness**: Understand how ASR models perform beyond standard benchmarks
-    * **Make Informed Choices**: Select models that work well for your target demographics
     ### How to Read the Results
     * **Average WER ⬇️**: Lower Word Error Rate (WER) is better
     * **Average per Gender**: Average WER for each gender
@@ -29,6 +21,36 @@ EXPLANATION = """
     Use the column filter to focus on specific demographics or view all results together.
     """
 TITLE = "<html> <head> <style> h1 {text-align: center;} </style> </head> <body> <h1> 🤗 Open Automatic Speech Recognition Leaderboard </b> </body> </html>"
 INTRODUCTION_TEXT = "📐 Results on [EdAcc Dataset](https://huggingface.co/datasets/edinburghcstr/edacc) split by accents and gender. \

 BANNER = f'<div style="display: flex; justify-content: space-around;"><img src="{banner_url}" alt="Banner" style="width: 40vw; min-width: 300px; max-width: 600px;"> </div>'
 EXPLANATION = """
     ### How to Read the Results
     * **Average WER ⬇️**: Lower Word Error Rate (WER) is better
     * **Average per Gender**: Average WER for each gender
     Use the column filter to focus on specific demographics or view all results together.
     """
+EXPLANATION_EDACC = """
+    ## EdAcc: Evaluating ASR Models Across Global English Accents
+    The Edinburgh International Accents of English Corpus (EdAcc) features over 40 distinct English accents from both native (L1) and non-native (L2) speakers. This evaluation helps you:
+    * **Compare Gender Performance**: Analyze how models perform across male and female speakers
+    * **Evaluate Regional Robustness**: Test model accuracy across European, Asian, African, and American accents
+    * **Assess Real-World Applicability**: Understand performance in natural conversational settings
+    The results show that:
+    * Larger models consistently outperform their smaller counterparts
+    * Multilingual models often handle accent diversity better than English-only variants
+    * Distilled models maintain good performance but show slight degradation on challenging accents
+    """
+EXPLANATION_AFRI = """
+    ## AfriSpeech: Testing ASR Robustness on African English Accents
+    The AfriSpeech Out-of-Distribution (OOD) test set features 20 distinct African English accents not present in common training data. This benchmark:
+    * **Challenges Model Generalization**: Tests performance on truly underrepresented accents
+    * **Reveals Robustness Gaps**: Highlights limitations in current ASR systems
+    * **Guides Improvement**: Identifies areas needing focused development
+    Key findings show:
+    * Full-sized models significantly outperform distilled versions
+    * Multilingual models demonstrate better generalization to African accents
+    * Even top performers show room for improvement on these challenging accents
+    """
 TITLE = "<html> <head> <style> h1 {text-align: center;} </style> </head> <body> <h1> 🤗 Open Automatic Speech Recognition Leaderboard </b> </body> </html>"
 INTRODUCTION_TEXT = "📐 Results on [EdAcc Dataset](https://huggingface.co/datasets/edinburghcstr/edacc) split by accents and gender. \