Upload folder using huggingface_hub

Files changed (4) hide show

.gitattributes CHANGED Viewed

@@ -33,3 +33,5 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text

 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text
+assets/audio/sample_00006_denoised.wav filter=lfs diff=lfs merge=lfs -text
+assets/audio/sample_00006_raw.wav filter=lfs diff=lfs merge=lfs -text

README.md CHANGED Viewed

@@ -22,6 +22,8 @@ pipeline_tag: audio-to-audio
 > **8 MB model · Runs fully on CPU in real time · Trained on 10,000+ hours of mixed audio · Under 1 ms processing per 10 ms of audio**
 [![GitHub](https://img.shields.io/badge/GitHub-Repository-blue?logo=github)](https://github.com/pulp-vision/Hush)
 [![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](LICENSE)
 [![Python](https://img.shields.io/badge/python-3.9%2B-blue.svg)](https://python.org)
@@ -29,6 +31,18 @@ pipeline_tag: audio-to-audio
 ---
 ## Model Overview
 Hush is designed from the ground up for **Voice AI applications** — phone-based voice agents, call centre bots, voice assistants, real-time transcription pipelines, and conversational AI systems. It isolates exactly one speaker from a live audio stream, in real time, under production conditions.

 > **8 MB model · Runs fully on CPU in real time · Trained on 10,000+ hours of mixed audio · Under 1 ms processing per 10 ms of audio**
+> 🚀 **Coming Soon:** We are currently fine-tuning a new model optimized specifically for environments with even **louder background noise and louder background speech**! Stay tuned for the upcoming release.
 [![GitHub](https://img.shields.io/badge/GitHub-Repository-blue?logo=github)](https://github.com/pulp-vision/Hush)
 [![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](LICENSE)
 [![Python](https://img.shields.io/badge/python-3.9%2B-blue.svg)](https://python.org)
 ---
+## Listen to the Model (Use headphones)
+**Raw Audio (Noisy Environment):**
+<audio controls src="https://huggingface.co/weya-ai/hush/resolve/main/assets/audio/sample_00006_raw.wav"></audio>
+**Denoised Audio (Hush Output):**
+<audio controls src="https://huggingface.co/weya-ai/hush/resolve/main/assets/audio/sample_00006_denoised.wav"></audio>
+---
 ## Model Overview
 Hush is designed from the ground up for **Voice AI applications** — phone-based voice agents, call centre bots, voice assistants, real-time transcription pipelines, and conversational AI systems. It isolates exactly one speaker from a live audio stream, in real time, under production conditions.

assets/audio/sample_00006_denoised.wav ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:8946bbce91695aee41b4c4ea1df3e9e148c6de452f43e4f9bc37e100572d8d4f
+size 160044

assets/audio/sample_00006_raw.wav ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:0d5ad72703bbed8794c04e9e60b86eb66af7d6167931fc3bcf427fd1671e05df
+size 160044