Update README.md
Browse files
README.md
CHANGED
|
@@ -18,6 +18,11 @@ OmniAudio is the world's fastest and most efficient audio-language model for on-
|
|
| 18 |
|
| 19 |
Unlike traditional approaches that chain ASR and LLM models together, OmniAudio-2.6B unifies both capabilities in a single efficient architecture for minimal latency and resource overhead.
|
| 20 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 21 |
## Demo
|
| 22 |
|
| 23 |
<video controls autoplay src="https://cdn-uploads.huggingface.co/production/uploads/6618e0424dbef6bd3c72f89a/538_aQ2hRexTlXFL-cYhW.mp4"></video>
|
|
@@ -32,10 +37,6 @@ On a 2024 Mac Mini M4 Pro, **Qwen2-Audio-7B-Instruct** running on 🤗 Transform
|
|
| 32 |
* **Recording Summary**: Simply ask "Can you summarize this meeting note?" to convert lengthy recordings into concise, actionable summaries.
|
| 33 |
* **Voice Tone Modification**: Transform casual voice memos into professional communications. When you request "Can you make this voice memo more professional?" OmniAudio adjusts the tone while preserving the core message.
|
| 34 |
|
| 35 |
-
## Quick Links
|
| 36 |
-
1. Interactive Demo in our [HuggingFace Space](https://huggingface.co/spaces/NexaAIDev/omni-audio-demo)
|
| 37 |
-
2. [Quickstart for local setup](#How-to-Use-On-Device)
|
| 38 |
-
3. Learn more in our [Blogs](https://nexa.ai/blogs/OmniAudio-2.6B)
|
| 39 |
|
| 40 |
## How to Use On Device
|
| 41 |
Step 1: Install Nexa-SDK (local on-device inference framework)
|
|
|
|
| 18 |
|
| 19 |
Unlike traditional approaches that chain ASR and LLM models together, OmniAudio-2.6B unifies both capabilities in a single efficient architecture for minimal latency and resource overhead.
|
| 20 |
|
| 21 |
+
## Quick Links
|
| 22 |
+
1. Interactive Demo in our [HuggingFace Space](https://huggingface.co/spaces/NexaAIDev/omni-audio-demo)
|
| 23 |
+
2. [Quickstart for local setup](#How-to-Use-On-Device)
|
| 24 |
+
3. Learn more in our [Blogs](https://nexa.ai/blogs/OmniAudio-2.6B)
|
| 25 |
+
|
| 26 |
## Demo
|
| 27 |
|
| 28 |
<video controls autoplay src="https://cdn-uploads.huggingface.co/production/uploads/6618e0424dbef6bd3c72f89a/538_aQ2hRexTlXFL-cYhW.mp4"></video>
|
|
|
|
| 37 |
* **Recording Summary**: Simply ask "Can you summarize this meeting note?" to convert lengthy recordings into concise, actionable summaries.
|
| 38 |
* **Voice Tone Modification**: Transform casual voice memos into professional communications. When you request "Can you make this voice memo more professional?" OmniAudio adjusts the tone while preserving the core message.
|
| 39 |
|
|
|
|
|
|
|
|
|
|
|
|
|
| 40 |
|
| 41 |
## How to Use On Device
|
| 42 |
Step 1: Install Nexa-SDK (local on-device inference framework)
|