README / README.md
Approximetal's picture
Update README.md
8ef7026 verified
|
raw
history blame
1.01 kB
metadata
title: README
emoji: 📈
colorFrom: indigo
colorTo: red
sdk: static
pinned: false

LEMAS: A 150K-Hour Large-scale Extensible Multilingual Audio Suite with Generative Speech Models

LEMAS is a large-scale extensible multilingual audio suite, providing the largest open-source multilingual speech corpus with word-level timestamps to our knowledge, covering over 150,000 hours across 10 major languages. Built with a rigorous alignment and confidence-based filtering pipeline, LEMAS supports diverse generative paradigms including zero-shot multilingual synthesis (LEMAS-TTS) and seamless speech editing (LEMAS-Edit).

Citation

@article{zhao2026lemas, title={LEMAS: A 150K-Hour Large-scale Extensible Multilingual Audio Suite with Generative Speech Models}, author={Zhao, Zhiyuan and Lin, Lijian and Zhu, Ye and Xie, Kai and Liu, Yunfei and Li, Yu}, journal={arXiv preprint arXiv:2601.04233}, year={2026} }