Spaces:
Running
Running
| title: Audio Translator | |
| emoji: π₯ | |
| colorFrom: pink | |
| colorTo: purple | |
| sdk: gradio | |
| sdk_version: 5.31.0 | |
| app_file: app.py | |
| pinned: false | |
| license: apache-2.0 | |
| short_description: Audio Translator | |
| # π£οΈ Audio Translator | |
| [](https://huggingface.co/spaces/<YOUR-USERNAME>/audio-translator) | |
| [] | |
| [] | |
| [] | |
| [] | |
| [](LICENSE) | |
| --- | |
| ## π Overview | |
| Combine **ASR**, **machine translation**, and **neural TTS** into one **seamless audio pipeline**β100 % **CPU** on free-tier HF Spaces. | |
| Upload speech, auto-detect language, translate into English or Spanish, then hear it spoken back. | |
| > **AI buzzwords:** | |
| > β’ Automatic Speech Recognition (ASR) β’ Whisper Tiny β’ Neural Machine Translation β’ GoogleTranslator β’ Text-to-Speech β’ gTTS β’ Multi-modal AI β’ End-to-End Inference β’ Real-Time β’ Edge Deployment | |
| --- | |
| ## β¨ Features | |
| | π Feature | π Description | | |
| |---------------------------|---------------------------------------------------------------| | |
| | **ποΈ ASR: Whisper-Tiny** | Lightning-fast, on-device speech transcription (all languages) | | |
| | **π Translation** | Bidirectional English β Spanish via Deep-Translator | | |
| | **π£οΈ Neural TTS** | High-quality audio playback via the free Google Translate TTS | | |
| | **β‘ Zero-infra CPU** | Runs on 2 vCPU / 16 GB RAMβno GPU or paid APIs needed | | |
| | **π¨ Elegant UI** | Intuitive Gradio Blocksβupload, buttons, transcripts, audio | | |
| | **π§ Fully Modular** | Swap models or add logging/analytics with minimal edits | | |
| --- | |
| ## ποΈ Architecture & Workflow | |
| 1. **Audio Upload** | |
| User uploads any `.wav` or `.mp3` clip. | |
| 2. **ASR** | |
| OpenAIβs `whisper-tiny` decodes speech into text. | |
| 3. **MT** | |
| `deep-translator`βs GoogleTranslator converts text to chosen language. | |
| 4. **TTS** | |
| `gTTS` synthesizes the translated text into an `.mp3`. | |
| 5. **UI Rendering** | |
| Gradio presents the original transcript, the translation, and an audio player. | |
| --- | |
| ## π οΈ Quick Start (Local Dev) | |
| ```bash | |
| git clone https://github.com/<YOUR-USERNAME>/audio-translator.git | |
| cd audio-translator | |
| python3 -m venv venv && source venv/bin/activate | |
| pip install -r requirements.txt | |
| python app.py | |
| ## Latest Update | |
| - Upgraded Whisper-Tiny model for faster ASR. - May 29, 2025 π | |
| - Enhanced gTTS audio quality. π£οΈ - June 05, 2025 π | |
| - Improved translation accuracy for Spanish. ποΈ - June 04, 2025 π | |
| - Upgraded Whisper-Tiny model for faster ASR. π - June 03, 2025 π | |
| - Optimized pipeline for lower latency. π₯ - June 02, 2025 π | |
| - Added support for additional audio formats. π£οΈ - June 01, 2025 π | |
| - Enhanced gTTS audio quality. - May 31, 2025 π | |
| - Improved translation accuracy for Spanish. β‘ - May 30, 2025 π | |
| **Website**: https://ghostainews.com/ | |
| **Discord**: https://discord.gg/BfA23aYz |