Text-to-Speech
ONNX
zero-shot
multilingual
Approximetal commited on
Commit
21ef83d
·
verified ·
1 Parent(s): 7b632eb

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +53 -0
README.md ADDED
@@ -0,0 +1,53 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ datasets:
3
+ - LEMAS-Project/LEMAS-Dataset-train
4
+ - LEMAS-Project/LEMAS-Dataset-eval
5
+ language:
6
+ - it
7
+ - pt
8
+ - es
9
+ - fr
10
+ - de
11
+ - en
12
+ - zh
13
+ license: cc-by-nc-4.0
14
+ pipeline_tag: text-to-speech
15
+ tags:
16
+ - zero-shot
17
+ - multilingual
18
+ ---
19
+
20
+ # LEMAS-Edit
21
+
22
+ LEMAS-Edit is a multilingual zero-shot speech editing system, presented in the paper [LEMAS: A 150K-Hour Large-scale Extensible Multilingual Audio Suite with Generative Speech Models](https://huggingface.co/papers/2601.04233).
23
+
24
+ - **Project Page:** [https://lemas-project.github.io/LEMAS-Project](https://lemas-project.github.io/LEMAS-Project)
25
+ - **Paper:** [https://arxiv.org/abs/2601.04233](https://arxiv.org/abs/2601.04233)
26
+ - **GitHub Repository:** [https://github.com/LEMAS-Project/LEMAS-Edit](https://github.com/LEMAS-Project/LEMAS-Edit)
27
+ - **Hugging Face Demo:** [https://huggingface.co/spaces/LEMAS-Project/LEMAS-Edit](https://huggingface.co/spaces/LEMAS-Project/LEMAS-Edit)
28
+
29
+ ## Supported Languages
30
+
31
+ The model supports 7 major languages for zero-shot synthesis:
32
+ - Chinese (zh)
33
+ - English (en)
34
+ - Spanish (es)
35
+ - French (fr)
36
+ - German (de)
37
+ - Italian (it)
38
+ - Portuguese (pt)
39
+
40
+ ## Training Data
41
+
42
+ LEMAS-Edit was trained on the subset of [LEMAS-Dataset](https://huggingface.co/datasets/LEMAS-Project/LEMAS-Dataset-train), which is, to our knowledge, currently the largest open-source multilingual speech corpus with word-level timestamps. It covers over 150,000 hours across 10 major languages.
43
+
44
+ ## Citation
45
+
46
+ ```bibtex
47
+ @article{zhao2026lemas,
48
+ title={LEMAS: A 150K-Hour Large-scale Extensible Multilingual Audio Suite with Generative Speech Models},
49
+ author={Zhao, Zhiyuan and Lin, Lijian and Zhu, Ye and Xie, Kai and Liu, Yunfei and Li, Yu},
50
+ journal={arXiv preprint arXiv:2601.04233},
51
+ year={2026}
52
+ }
53
+ ```