AImpower
/

StutteredSpeechASR

@@ -23,23 +23,27 @@ The model was fine-tuned on the **AS-70: A Mandarin stuttered speech dataset** f
 ### Authors of the Dataset & Paper
-Rong Gong, Hongfei Xue, Lezhi Wang, Xin Xu, Qisheng Li, Lei Xie, Hui Bu, Shaomei Wu, Jiaming Zhou, Yong Qin, Binbin Zhang, Jun Du, Jia Bin, Ming Li
 ## Intended Uses
-- Transcribing Mandarin Chinese audio, particularly for speakers who stutter.
-- Research in speech therapy, clinical linguistics, or accessibility applications.
 ### Out-of-Scope Use
 - Non-Chinese languages or highly noisy audio.
 - Real-time transcription without optimization.
-- Sensitive or legal audio without human verification.
 ## Limitations & Risks
 - Accuracy may drop on fast speech, mixed-language speech, or heavy background noise.
-- Stuttering patterns may still cause transcription errors.
 - Not recommended to use as sole source for clinical or legal decisions.
 ## How to Use
@@ -81,4 +85,4 @@ model.to(device)
 ## Citation
 **Paper:**
-Gong, Rong, et al. "As-70: A mandarin stuttered speech dataset for automatic speech recognition and stuttering event detection." arXiv preprint arXiv:2406.07256 (2024).

 ### Authors of the Dataset & Paper
+- Dataset: Rong Gong, Hongfei Xue, Lezhi Wang, Xin Xu, Qisheng Li, Lei Xie, Hui Bu, Shaomei Wu, Jiaming Zhou, Yong Qin, Binbin Zhang, Jun Du, Jia Bin, Ming Li
+- Dataset paper: Gong, R., Xue, H., Wang, L., Xu, X., Li, Q., Xie, L., Bu, H., Wu, S., Zhou, J., Qin, Y., Zhang, B., Du, J., Bin, J., Li, M. (2024) AS-70: A Mandarin stuttered speech dataset for automatic speech recognition and stuttering event detection. Proc. Interspeech 2024, 5098-5102, doi: 10.21437/Interspeech.2024-918
+- Fine-tuning paper: Jingjin Li, Qisheng Li, Rong Gong, Lezhi Wang, and Shaomei Wu. 2025. Our Collective Voices: The Social and Technical Values of a Grassroots Chinese Stuttered Speech Dataset. In Proceedings of the 2025 ACM Conference on Fairness, Accountability, and Transparency (FAccT '25). Association for Computing Machinery, New York, NY, USA, 2768–2783. https://doi.org/10.1145/3715275.3732179
 ## Intended Uses
+- Transcribing Mandarin Chinese spoken language verbatim, particularly for speakers who stutter.
+- Research in stuttering affirming speech therapy, clinical linguistics, or accessibility applications.
 ### Out-of-Scope Use
 - Non-Chinese languages or highly noisy audio.
 - Real-time transcription without optimization.
+- Sensitive or legal audio without human verification.
+- Other use cases that undermine the dignity and quality of life of people who stutter.
 ## Limitations & Risks
 - Accuracy may drop on fast speech, mixed-language speech, or heavy background noise.
+- Stuttering is highly variable and heterogenous, certain stuttering patterns may still result in high transcription errors.
 - Not recommended to use as sole source for clinical or legal decisions.
 ## How to Use
 ## Citation
 **Paper:**
+Jingjin Li, Qisheng Li, Rong Gong, Lezhi Wang, and Shaomei Wu. 2025. Our Collective Voices: The Social and Technical Values of a Grassroots Chinese Stuttered Speech Dataset. In Proceedings of the 2025 ACM Conference on Fairness, Accountability, and Transparency (FAccT '25). Association for Computing Machinery, New York, NY, USA, 2768–2783. https://doi.org/10.1145/3715275.3732179