update README only
Browse files
README.md
CHANGED
|
@@ -258,8 +258,11 @@ We introduce a robust evaluation framework leveraging **MMS-FA** for alignment a
|
|
| 258 |
|
| 259 |
### Subjective Evaluation
|
| 260 |
For open-source models, annotators are asked to score each sample pair in terms of speaker attribution accuracy, voice similarity, prosody, and overall quality. Following the methodology of the LMSYS Chatbot Arena, we compute Elo ratings and confidence intervals for each dimension.
|
| 261 |
-
|
|
|
|
|
|
|
| 262 |
|
| 263 |
For closed-source models, annotators are only asked to choose the overall preferred one in each pair, and we compute the win rate accordingly.
|
| 264 |
-
|
| 265 |
-
|
|
|
|
|
|
| 258 |
|
| 259 |
### Subjective Evaluation
|
| 260 |
For open-source models, annotators are asked to score each sample pair in terms of speaker attribution accuracy, voice similarity, prosody, and overall quality. Following the methodology of the LMSYS Chatbot Arena, we compute Elo ratings and confidence intervals for each dimension.
|
| 261 |
+
<p align="center">
|
| 262 |
+
<img src="https://speech-demo.oss-cn-shanghai.aliyuncs.com/moss_tts_demo/tts_readme_imgaes_demo/moss_ttsd_subjective_evaluation" width="85%" />
|
| 263 |
+
</p>
|
| 264 |
|
| 265 |
For closed-source models, annotators are only asked to choose the overall preferred one in each pair, and we compute the win rate accordingly.
|
| 266 |
+
<p align="center">
|
| 267 |
+
<img src="https://speech-demo.oss-cn-shanghai.aliyuncs.com/moss_tts_demo/tts_readme_imgaes_demo/moss_ttsd_winrate" width="85%" />
|
| 268 |
+
</p>
|