Spaces:
Running
Running
metadata
title: README
emoji: 📈
colorFrom: indigo
colorTo: red
sdk: static
pinned: false
LEMAS: A 150K-Hour Large-scale Extensible Multilingual Audio Suite with Generative Speech Models
LEMAS is a large-scale extensible multilingual audio suite, providing the largest open-source multilingual speech corpus with word-level timestamps to our knowledge, covering over 150,000 hours across 10 major languages. Built with a rigorous alignment and confidence-based filtering pipeline, LEMAS supports diverse generative paradigms including zero-shot multilingual synthesis (LEMAS-TTS) and seamless speech editing (LEMAS-Edit).
Citation
@article{zhao2026lemas, title={LEMAS: A 150K-Hour Large-scale Extensible Multilingual Audio Suite with Generative Speech Models}, author={Zhao, Zhiyuan and Lin, Lijian and Zhu, Ye and Xie, Kai and Liu, Yunfei and Li, Yu}, journal={arXiv preprint arXiv:2601.04233}, year={2026} }