update readme
Browse files
README.md
CHANGED
|
@@ -36,6 +36,8 @@ When a single piece of audio needs to **sound like a real person**, **pronounce
|
|
| 36 |
|
| 37 |
|
| 38 |
# MOSS-TTS
|
|
|
|
|
|
|
| 39 |
## 1. Overview
|
| 40 |
### 1.1 TTS Family Positioning
|
| 41 |
MOSS-TTS is the **flagship base model** in our open-source **TTS Family**. It is designed as a production-ready synthesis backbone that can serve as the primary high-quality engine for scalable voice applications, and as a strong research baseline for controllable TTS and discrete audio token modeling.
|
|
|
|
| 36 |
|
| 37 |
|
| 38 |
# MOSS-TTS
|
| 39 |
+
**MOSS-TTS** is a next-generation, production-grade TTS foundation model focused on **voice cloning**, **ultra-long stable speech generation**, **token-level duration control**, **multilingual & code-switched synthesis**, and **fine-grained Pinyin/phoneme-level pronunciation control**. It is built on a clean autoregressive discrete-token recipe that emphasizes high-quality audio tokenization, large-scale diverse pre-training data, and efficient discrete token modeling.
|
| 40 |
+
|
| 41 |
## 1. Overview
|
| 42 |
### 1.1 TTS Family Positioning
|
| 43 |
MOSS-TTS is the **flagship base model** in our open-source **TTS Family**. It is designed as a production-ready synthesis backbone that can serve as the primary high-quality engine for scalable voice applications, and as a strong research baseline for controllable TTS and discrete audio token modeling.
|