Spaces:
Running
Running
Update templates/index.html
Browse files- templates/index.html +10 -10
templates/index.html
CHANGED
|
@@ -75,12 +75,11 @@
|
|
| 75 |
|
| 76 |
<div id="evaluation-info" class="mb-5">
|
| 77 |
<p class="text-start">
|
| 78 |
-
<span class="section-title"><i class="fas fa-info-circle"></i>Welcome
|
| 79 |
-
<strong>Welcome to the Speech-to-Speech (S2S) Model Evaluation! 🎤</strong>
|
| 80 |
<br><br>
|
| 81 |
In this evaluation, you will assess the performance of 4 S2S models:
|
| 82 |
-
<strong>ChatGPT-4o</strong
|
| 83 |
-
<strong>Mini-Omni</strong
|
| 84 |
The goal is to evaluate how well these models handle various speech tasks across different domains.
|
| 85 |
<br><br>
|
| 86 |
<span class="section-title"><i class="fas fa-tasks"></i>How It Works</span>
|
|
@@ -89,8 +88,9 @@
|
|
| 89 |
For example:
|
| 90 |
<br><br>
|
| 91 |
|
| 92 |
-
<strong>Audio
|
| 93 |
-
|
|
|
|
| 94 |
<source src="/static/audio/sample/input_audio.wav" type="audio/wav">
|
| 95 |
</audio>
|
| 96 |
|
|
@@ -108,7 +108,7 @@
|
|
| 108 |
|
| 109 |
<!-- ChatGPT-4o Output -->
|
| 110 |
<span><strong>ChatGPT-4o:</strong></span>
|
| 111 |
-
<audio controls>
|
| 112 |
<source src="/static/audio/sample/4o_audio.wav" type="audio/wav">
|
| 113 |
</audio>
|
| 114 |
<p class="text-start" style="margin-left: 20px;">
|
|
@@ -117,7 +117,7 @@
|
|
| 117 |
|
| 118 |
<!-- FunAudioLLM Output -->
|
| 119 |
<span><strong>FunAudioLLM:</strong></span>
|
| 120 |
-
<audio controls>
|
| 121 |
<source src="/static/audio/sample/FunAudio_audio.wav" type="audio/wav">
|
| 122 |
</audio>
|
| 123 |
<p class="text-start" style="margin-left: 20px;">
|
|
@@ -126,7 +126,7 @@
|
|
| 126 |
|
| 127 |
<!-- SpeechGPT Output -->
|
| 128 |
<span><strong>SpeechGPT:</strong></span>
|
| 129 |
-
<audio controls>
|
| 130 |
<source src="/static/audio/sample/SpeechGPT.wav" type="audio/wav">
|
| 131 |
</audio>
|
| 132 |
<p class="text-start" style="margin-left: 20px;">
|
|
@@ -135,7 +135,7 @@
|
|
| 135 |
|
| 136 |
<!-- Mini-Omni Output -->
|
| 137 |
<span><strong>Mini-Omni:</strong></span>
|
| 138 |
-
<audio controls>
|
| 139 |
<source src="/static/audio/sample/mini-omni.wav" type="audio/wav">
|
| 140 |
</audio>
|
| 141 |
<p class="text-start" style="margin-left: 20px;">
|
|
|
|
| 75 |
|
| 76 |
<div id="evaluation-info" class="mb-5">
|
| 77 |
<p class="text-start">
|
| 78 |
+
<span class="section-title"><i class="fas fa-info-circle"></i>Welcome to the Speech-to-Speech (S2S) Model Evaluation! 🎤</span>
|
|
|
|
| 79 |
<br><br>
|
| 80 |
In this evaluation, you will assess the performance of 4 S2S models:
|
| 81 |
+
<strong>ChatGPT-4o</strong>, <strong>FunAudioLLM</strong>, <strong>SpeechGPT</strong>, and
|
| 82 |
+
<strong>Mini-Omni</strong>.
|
| 83 |
The goal is to evaluate how well these models handle various speech tasks across different domains.
|
| 84 |
<br><br>
|
| 85 |
<span class="section-title"><i class="fas fa-tasks"></i>How It Works</span>
|
|
|
|
| 88 |
For example:
|
| 89 |
<br><br>
|
| 90 |
|
| 91 |
+
<span style="vertical-align: middle; line-height: 1.2; display: inline-block;"><strong>Audio
|
| 92 |
+
Sample:</strong></span>
|
| 93 |
+
<audio controls style="vertical-align: middle;">
|
| 94 |
<source src="/static/audio/sample/input_audio.wav" type="audio/wav">
|
| 95 |
</audio>
|
| 96 |
|
|
|
|
| 108 |
|
| 109 |
<!-- ChatGPT-4o Output -->
|
| 110 |
<span><strong>ChatGPT-4o:</strong></span>
|
| 111 |
+
<audio controls style="vertical-align: middle;">
|
| 112 |
<source src="/static/audio/sample/4o_audio.wav" type="audio/wav">
|
| 113 |
</audio>
|
| 114 |
<p class="text-start" style="margin-left: 20px;">
|
|
|
|
| 117 |
|
| 118 |
<!-- FunAudioLLM Output -->
|
| 119 |
<span><strong>FunAudioLLM:</strong></span>
|
| 120 |
+
<audio controls style="vertical-align: middle;">
|
| 121 |
<source src="/static/audio/sample/FunAudio_audio.wav" type="audio/wav">
|
| 122 |
</audio>
|
| 123 |
<p class="text-start" style="margin-left: 20px;">
|
|
|
|
| 126 |
|
| 127 |
<!-- SpeechGPT Output -->
|
| 128 |
<span><strong>SpeechGPT:</strong></span>
|
| 129 |
+
<audio controls style="vertical-align: middle;">
|
| 130 |
<source src="/static/audio/sample/SpeechGPT.wav" type="audio/wav">
|
| 131 |
</audio>
|
| 132 |
<p class="text-start" style="margin-left: 20px;">
|
|
|
|
| 135 |
|
| 136 |
<!-- Mini-Omni Output -->
|
| 137 |
<span><strong>Mini-Omni:</strong></span>
|
| 138 |
+
<audio controls style="vertical-align: middle;">
|
| 139 |
<source src="/static/audio/sample/mini-omni.wav" type="audio/wav">
|
| 140 |
</audio>
|
| 141 |
<p class="text-start" style="margin-left: 20px;">
|