JASTIN: Aligning LLMs for Zero-Shot Audio and Speech Evaluation via Natural Language Instructions Paper • 2605.04505 • Published 8 days ago • 2
CoVoMix2: Advancing Zero-Shot Dialogue Generation with Fully Non-Autoregressive Flow Matching Paper • 2506.00885 • Published Jun 1, 2025 • 1
view article Article TTS Arena: Benchmarking Text-to-Speech Models in the Wild +5 mrfakename, reach-vb, clefourrier, Wauplin, ylacombe, main-horse, sanchit-gandhi • Feb 27, 2024 • 72
espnet/Wangyou_Zhang_universal_train_enh_uses_refch0_2mem_raw Audio-to-Audio • Updated Jun 1, 2025 • 4 • 2