Upload folder using huggingface_hub
Browse files- .gitattributes +2 -0
- README.md +14 -0
- assets/audio/sample_00006_denoised.wav +3 -0
- assets/audio/sample_00006_raw.wav +3 -0
.gitattributes
CHANGED
|
@@ -33,3 +33,5 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
|
| 33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
| 34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
| 35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
|
|
|
|
|
|
|
|
| 33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
| 34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
| 35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
| 36 |
+
assets/audio/sample_00006_denoised.wav filter=lfs diff=lfs merge=lfs -text
|
| 37 |
+
assets/audio/sample_00006_raw.wav filter=lfs diff=lfs merge=lfs -text
|
README.md
CHANGED
|
@@ -22,6 +22,8 @@ pipeline_tag: audio-to-audio
|
|
| 22 |
|
| 23 |
> **8 MB model · Runs fully on CPU in real time · Trained on 10,000+ hours of mixed audio · Under 1 ms processing per 10 ms of audio**
|
| 24 |
|
|
|
|
|
|
|
| 25 |
[](https://github.com/pulp-vision/Hush)
|
| 26 |
[](LICENSE)
|
| 27 |
[](https://python.org)
|
|
@@ -29,6 +31,18 @@ pipeline_tag: audio-to-audio
|
|
| 29 |
|
| 30 |
---
|
| 31 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 32 |
## Model Overview
|
| 33 |
|
| 34 |
Hush is designed from the ground up for **Voice AI applications** — phone-based voice agents, call centre bots, voice assistants, real-time transcription pipelines, and conversational AI systems. It isolates exactly one speaker from a live audio stream, in real time, under production conditions.
|
|
|
|
| 22 |
|
| 23 |
> **8 MB model · Runs fully on CPU in real time · Trained on 10,000+ hours of mixed audio · Under 1 ms processing per 10 ms of audio**
|
| 24 |
|
| 25 |
+
> 🚀 **Coming Soon:** We are currently fine-tuning a new model optimized specifically for environments with even **louder background noise and louder background speech**! Stay tuned for the upcoming release.
|
| 26 |
+
|
| 27 |
[](https://github.com/pulp-vision/Hush)
|
| 28 |
[](LICENSE)
|
| 29 |
[](https://python.org)
|
|
|
|
| 31 |
|
| 32 |
---
|
| 33 |
|
| 34 |
+
## Listen to the Model (Use headphones)
|
| 35 |
+
|
| 36 |
+
**Raw Audio (Noisy Environment):**
|
| 37 |
+
|
| 38 |
+
<audio controls src="https://huggingface.co/weya-ai/hush/resolve/main/assets/audio/sample_00006_raw.wav"></audio>
|
| 39 |
+
|
| 40 |
+
**Denoised Audio (Hush Output):**
|
| 41 |
+
|
| 42 |
+
<audio controls src="https://huggingface.co/weya-ai/hush/resolve/main/assets/audio/sample_00006_denoised.wav"></audio>
|
| 43 |
+
|
| 44 |
+
---
|
| 45 |
+
|
| 46 |
## Model Overview
|
| 47 |
|
| 48 |
Hush is designed from the ground up for **Voice AI applications** — phone-based voice agents, call centre bots, voice assistants, real-time transcription pipelines, and conversational AI systems. It isolates exactly one speaker from a live audio stream, in real time, under production conditions.
|
assets/audio/sample_00006_denoised.wav
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:8946bbce91695aee41b4c4ea1df3e9e148c6de452f43e4f9bc37e100572d8d4f
|
| 3 |
+
size 160044
|
assets/audio/sample_00006_raw.wav
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:0d5ad72703bbed8794c04e9e60b86eb66af7d6167931fc3bcf427fd1671e05df
|
| 3 |
+
size 160044
|