Spaces:
Paused
Paused
Update README.md
Browse files
README.md
CHANGED
|
@@ -1,5 +1,5 @@
|
|
| 1 |
---
|
| 2 |
-
title:
|
| 3 |
emoji: 🎵
|
| 4 |
colorFrom: blue
|
| 5 |
colorTo: green
|
|
@@ -9,50 +9,5 @@ app_file: app.py
|
|
| 9 |
pinned: false
|
| 10 |
---
|
| 11 |
|
| 12 |
-
#
|
| 13 |
-
|
| 14 |
-
**Professional 3-Department Audio Processing Pipeline**
|
| 15 |
-
Runs 100% free on Hugging Face ZeroGPU (A10G · 24 GB VRAM)
|
| 16 |
-
|
| 17 |
-
## What It Does
|
| 18 |
-
|
| 19 |
-
Upload any audio file and ClearWave AI runs it through three AI departments:
|
| 20 |
-
|
| 21 |
-
| Dept | Model | What it does |
|
| 22 |
-
|------|-------|--------------|
|
| 23 |
-
| 🎙️ Denoiser | DeepFilterNet3 | Removes background noise, EBU R128 normalisation |
|
| 24 |
-
| 📝 Transcriber | Groq Whisper large-v3 | Speech-to-text, 10-20x faster than local Whisper |
|
| 25 |
-
| 🌐 Translator | NLLB-200-distilled-600M | Offline translation, 200 languages |
|
| 26 |
-
|
| 27 |
-
**Example:**
|
| 28 |
-
```
|
| 29 |
-
Input : English audio "Hello this is a test"
|
| 30 |
-
Original (EN) : Hello this is a test
|
| 31 |
-
Translated (TE): హలో ఇది ఒక పరీక్ష
|
| 32 |
-
Total time : ~6 seconds
|
| 33 |
-
```
|
| 34 |
-
|
| 35 |
-
## Setting Your Groq API Key
|
| 36 |
-
|
| 37 |
-
1. Get a free key at [console.groq.com](https://console.groq.com)
|
| 38 |
-
2. In your Space: **Settings → Variables and secrets → New secret**
|
| 39 |
-
3. Name: `GROQ_API_KEY`, Value: your key (`gsk_...`)
|
| 40 |
-
4. Save — Space restarts automatically
|
| 41 |
-
|
| 42 |
-
Without a key, the app falls back to local Whisper small (still works, slower).
|
| 43 |
-
|
| 44 |
-
## How to Use
|
| 45 |
-
|
| 46 |
-
1. Upload any audio file (MP3, WAV, AAC, OGG, M4A, FLAC, M4A, OPUS...)
|
| 47 |
-
2. Set Input Language (or leave as Auto Detect)
|
| 48 |
-
3. Set Output Language
|
| 49 |
-
4. Click **Process Audio**
|
| 50 |
-
5. View results in the Text Results, Clean Audio, and Timings tabs
|
| 51 |
-
|
| 52 |
-
## Supported Languages
|
| 53 |
-
|
| 54 |
-
English · Telugu · Hindi · Tamil · Kannada (+ 195 more via NLLB-200)
|
| 55 |
-
|
| 56 |
-
## Cost
|
| 57 |
-
|
| 58 |
-
**$0** — Hugging Face ZeroGPU + Groq free tier (14,400s audio/day)
|
|
|
|
| 1 |
---
|
| 2 |
+
title: ClearWave AI
|
| 3 |
emoji: 🎵
|
| 4 |
colorFrom: blue
|
| 5 |
colorTo: green
|
|
|
|
| 9 |
pinned: false
|
| 10 |
---
|
| 11 |
|
| 12 |
+
# ClearWave AI
|
| 13 |
+
3-Department Audio Pipeline: Denoise → Transcribe → Translate
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|