update readme
Browse files- README.md +1 -1
- assets/moss_tts_family.jpeg +0 -3
README.md
CHANGED
|
@@ -10,7 +10,7 @@ MOSS‑TTS Family is an open‑source **speech and sound generation model family
|
|
| 10 |
## Introduction
|
| 11 |
|
| 12 |
<p align="center">
|
| 13 |
-
<img src="
|
| 14 |
</p>
|
| 15 |
|
| 16 |
When a single piece of audio needs to **sound like a real person**, **pronounce every word accurately**, **switch speaking styles across content**, **remain stable over tens of minutes**, and **support dialogue, role‑play, and real‑time interaction**, a single TTS model is often not enough. The **MOSS‑TTS Family** breaks the workflow into five production‑ready models that can be used independently or composed into a complete pipeline.
|
|
|
|
| 10 |
## Introduction
|
| 11 |
|
| 12 |
<p align="center">
|
| 13 |
+
<img src="https://speech-demo.oss-cn-shanghai.aliyuncs.com/moss_tts_demo/tts_readme_imgaes_demo/moss_tts_family_arch.jpeg" width="85%" />
|
| 14 |
</p>
|
| 15 |
|
| 16 |
When a single piece of audio needs to **sound like a real person**, **pronounce every word accurately**, **switch speaking styles across content**, **remain stable over tens of minutes**, and **support dialogue, role‑play, and real‑time interaction**, a single TTS model is often not enough. The **MOSS‑TTS Family** breaks the workflow into five production‑ready models that can be used independently or composed into a complete pipeline.
|
assets/moss_tts_family.jpeg
DELETED
Git LFS Details
|