Spoken Stereoset: On Evaluating Social Bias Toward Speaker in Speech Large Language Models Paper • 2408.07665 • Published Aug 14, 2024
EMO-Debias: Benchmarking Gender Debiasing Techniques in Multi-Label Speech Emotion Recognition Paper • 2506.04652 • Published Jun 5 • 1
Fake-Mamba: Real-Time Speech Deepfake Detection Using Bidirectional Mamba as Self-Attention's Alternative Paper • 2508.09294 • Published Aug 12
Meta-PerSER: Few-Shot Listener Personalized Speech Emotion Recognition via Meta-learning Paper • 2505.16220 • Published May 22 • 1
Do You Hear What I Mean? Quantifying the Instruction-Perception Gap in Instruction-Guided Expressive Text-To-Speech Systems Paper • 2509.13989 • Published Sep 17 • 3
CO-VADA: A Confidence-Oriented Voice Augmentation Debiasing Approach for Fair Speech Emotion Recognition Paper • 2506.06071 • Published Jun 6 • 1
MI-Fuse: Label Fusion for Unsupervised Domain Adaptation with Closed-Source Large-Audio Language Model Paper • 2509.20706 • Published Sep 25 • 2
Codec-SUPERB: An In-Depth Analysis of Sound Codec Models Paper • 2402.13071 • Published Feb 20, 2024 • 1