Update README.md
Browse files
README.md
CHANGED
|
@@ -14,12 +14,12 @@ language:
|
|
| 14 |
---
|
| 15 |
|
| 16 |
<p align="center">
|
| 17 |
-
<img src="https://
|
| 18 |
<b>Voila: <span style="color:#ca00f9">Voi</span>ce-<span style="color:#ca00f9">La</span>nguage Foundation Models</b><br/><br/>
|
| 19 |
-
💜 <a href="https://
|
| 20 |
</p>
|
| 21 |
|
| 22 |
-
Voila is a groundbreaking family of large audio-language foundation models that revolutionizes human-AI interactions. Breaking away from the constraints of traditional voice AI systems—high latency, loss of vocal nuances, and mechanical responses, Voila employs an innovative end-to-end model design and a novel hierarchical Transformer architecture. This approach enables real-time, autonomous, and rich voice interactions, with latency as low as 195 ms, surpassing average human response times. Combining advanced voice and language modeling, Voila offers customizable, persona-driven engagements and excels in a range of audio tasks from ASR and TTS to speech translation across six languages. With the online [web demo](https://
|
| 23 |
|
| 24 |
# ✨ Highlights
|
| 25 |
- ⭐ High-fidelity, low-latency, real-time streaming audio processing
|
|
@@ -28,12 +28,7 @@ Voila is a groundbreaking family of large audio-language foundation models that
|
|
| 28 |
- ⭐ Unified model for various audio tasks
|
| 29 |
|
| 30 |
# 🎥 Video Demo
|
| 31 |
-
|
| 32 |
-
<video width="60%" controls>
|
| 33 |
-
<source src="https://voila.maitrix.org/static/videos/voila-demo.mp4" type="video/mp4">
|
| 34 |
-
Your browser does not support the video tag.
|
| 35 |
-
</video>
|
| 36 |
-
</div>
|
| 37 |
|
| 38 |
# 🔥 Latest News!!
|
| 39 |
|
|
|
|
| 14 |
---
|
| 15 |
|
| 16 |
<p align="center">
|
| 17 |
+
<img src="https://maitrix-org.github.io/Voila-blog/static/images/logo.png" width="400"/><br/>
|
| 18 |
<b>Voila: <span style="color:#ca00f9">Voi</span>ce-<span style="color:#ca00f9">La</span>nguage Foundation Models</b><br/><br/>
|
| 19 |
+
💜 <a href="https://maitrix-org.github.io/Voila-blog"><b>Voila</b></a>    |    🖥️ <a href="https://github.com/maitrix-org/Voila">GitHub</a>    |   🤗 <a href="https://huggingface.co/collections/maitrix-org/voila-67e0d96962c19f221fc73fa5">Hugging Face</a>   |    📑 <a href="">Paper (Coming soon)</a>    |    🌐 <a href="https://huggingface.co/spaces/maitrix-org/Voila-demo">Demo</a>
|
| 20 |
</p>
|
| 21 |
|
| 22 |
+
Voila is a groundbreaking family of large audio-language foundation models that revolutionizes human-AI interactions. Breaking away from the constraints of traditional voice AI systems—high latency, loss of vocal nuances, and mechanical responses, Voila employs an innovative end-to-end model design and a novel hierarchical Transformer architecture. This approach enables real-time, autonomous, and rich voice interactions, with latency as low as 195 ms, surpassing average human response times. Combining advanced voice and language modeling, Voila offers customizable, persona-driven engagements and excels in a range of audio tasks from ASR and TTS to speech translation across six languages. With the online [web demo](https://huggingface.co/spaces/maitrix-org/Voila-demo), Voila invites you to explore a transformative, natural dialogue experience between human and AI.
|
| 23 |
|
| 24 |
# ✨ Highlights
|
| 25 |
- ⭐ High-fidelity, low-latency, real-time streaming audio processing
|
|
|
|
| 28 |
- ⭐ Unified model for various audio tasks
|
| 29 |
|
| 30 |
# 🎥 Video Demo
|
| 31 |
+
[](https://www.youtube.com/watch?v=J27M9-g5KL0)
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 32 |
|
| 33 |
# 🔥 Latest News!!
|
| 34 |
|