Full-Duplex-Bench: A Benchmark to Evaluate Full-duplex Spoken Dialogue Models on Turn-taking Capabilities Paper • 2503.04721 • Published Mar 6, 2025 • 2
Moshi: a speech-text foundation model for real-time dialogue Paper • 2410.00037 • Published Sep 17, 2024 • 11
WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing Paper • 2110.13900 • Published Oct 26, 2021 • 1