Vox-Profile: A Speech Foundation Model Benchmark for Characterizing Diverse Speaker and Speech Traits Paper • 2505.14648 • Published May 20, 2025 • 10
tiantiaf/whisper-large-v3-msp-podcast-emotion-dim Audio Classification • 2B • Updated Aug 10, 2025 • 7.42k • 3
tiantiaf/whisper-large-v3-msp-podcast-emotion Audio Classification • 2B • Updated Aug 10, 2025 • 7.21k • 5
Vox-Profile Collection This collection includes the implementation of models described in the Vox-Profile benchmark. (https://arxiv.org/pdf/2505.14648). • 14 items • Updated 23 days ago • 3
Voxlect - Whisper-Large-v3 Collection A Speech Foundation Model Benchmark for Classifying Dialects and Regional Languages around the Globe - Whisper-Large-v3 Family • 10 items • Updated Jan 27 • 2
Voxlect - MMS-LID-256 Collection A Speech Foundation Model Benchmark for Classifying Dialects and Regional Languages across the Globe - MMS-LID-256 Family • 10 items • Updated Aug 5, 2025 • 2
Voxlect - Whisper-Small Collection A Speech Foundation Model Benchmark for Classifying Dialects and Regional Languages around the Globe - Whisper-Small Family • 10 items • Updated Aug 9, 2025 • 2
Rethinking Training Targets, Architectures and Data Quality for Universal Speech Enhancement Paper • 2603.02641 • Published Mar 3 • 7