Spaces:
Sleeping
Sleeping
| title: Audio Translator | |
| emoji: π₯ | |
| colorFrom: pink | |
| colorTo: purple | |
| sdk: gradio | |
| sdk_version: 5.31.0 | |
| app_file: app.py | |
| pinned: false | |
| license: apache-2.0 | |
| short_description: Audio Translator | |
| # π£οΈ Audio Translator | |
| [](https://huggingface.co/spaces/<YOUR-USERNAME>/audio-translator) | |
| [] | |
| [] | |
| [] | |
| [] | |
| [](LICENSE) | |
| --- | |
| ## π Overview | |
| Combine **ASR**, **machine translation**, and **neural TTS** into one **seamless audio pipeline**β100 % **CPU** on free-tier HF Spaces. | |
| Upload speech, auto-detect language, translate into English or Spanish, then hear it spoken back. | |
| > **AI buzzwords:** | |
| > β’ Automatic Speech Recognition (ASR) β’ Whisper Tiny β’ Neural Machine Translation β’ GoogleTranslator β’ Text-to-Speech β’ gTTS β’ Multi-modal AI β’ End-to-End Inference β’ Real-Time β’ Edge Deployment | |
| --- | |
| ## β¨ Features | |
| | π Feature | π Description | | |
| |---------------------------|---------------------------------------------------------------| | |
| | **ποΈ ASR: Whisper-Tiny** | Lightning-fast, on-device speech transcription (all languages) | | |
| | **π Translation** | Bidirectional English β Spanish via Deep-Translator | | |
| | **π£οΈ Neural TTS** | High-quality audio playback via the free Google Translate TTS | | |
| | **β‘ Zero-infra CPU** | Runs on 2 vCPU / 16 GB RAMβno GPU or paid APIs needed | | |
| | **π¨ Elegant UI** | Intuitive Gradio Blocksβupload, buttons, transcripts, audio | | |
| | **π§ Fully Modular** | Swap models or add logging/analytics with minimal edits | | |
| --- | |
| ## ποΈ Architecture & Workflow | |
| 1. **Audio Upload** | |
| User uploads any `.wav` or `.mp3` clip. | |
| 2. **ASR** | |
| OpenAIβs `whisper-tiny` decodes speech into text. | |
| 3. **MT** | |
| `deep-translator`βs GoogleTranslator converts text to chosen language. | |
| 4. **TTS** | |
| `gTTS` synthesizes the translated text into an `.mp3`. | |
| 5. **UI Rendering** | |
| Gradio presents the original transcript, the translation, and an audio player. | |
| --- | |
| ## π οΈ Quick Start (Local Dev) | |
| ```bash | |
| git clone https://github.com/<YOUR-USERNAME>/audio-translator.git | |
| cd audio-translator | |
| python3 -m venv venv && source venv/bin/activate | |
| pip install -r requirements.txt | |
| python app.py | |
| ## Latest Update | |
| - Upgraded Whisper-Tiny model for faster ASR. - May 29, 2025 π | |
| - Enhanced gTTS audio quality. π - December 11, 2025 π | |
| - Improved translation accuracy for Spanish. π₯ - December 09, 2025 π | |
| - Upgraded Whisper-Tiny model for faster ASR. β‘ - December 06, 2025 π | |
| - Optimized pipeline for lower latency. - December 04, 2025 π | |
| - Added support for additional audio formats. - December 01, 2025 π | |
| - Enhanced gTTS audio quality. π - November 29, 2025 π | |
| - Improved translation accuracy for Spanish. π£οΈ - November 26, 2025 π | |
| - Upgraded Whisper-Tiny model for faster ASR. - November 24, 2025 π | |
| - Optimized pipeline for lower latency. ποΈ - November 21, 2025 π | |
| - Added support for additional audio formats. π£οΈ - November 19, 2025 π | |
| - Enhanced gTTS audio quality. - November 17, 2025 π | |
| - Improved translation accuracy for Spanish. - November 15, 2025 π | |
| - Upgraded Whisper-Tiny model for faster ASR. - November 11, 2025 π | |
| - Optimized pipeline for lower latency. π₯ - November 10, 2025 π | |
| - Added support for additional audio formats. - November 08, 2025 π | |
| - Enhanced gTTS audio quality. β‘ - November 06, 2025 π | |
| - Improved translation accuracy for Spanish. - November 05, 2025 π | |
| - Upgraded Whisper-Tiny model for faster ASR. π - November 03, 2025 π | |
| - Optimized pipeline for lower latency. ποΈ - November 01, 2025 π | |
| - Added support for additional audio formats. - October 29, 2025 π | |
| - Enhanced gTTS audio quality. π₯ - October 27, 2025 π | |
| - Improved translation accuracy for Spanish. - October 24, 2025 π | |
| - Upgraded Whisper-Tiny model for faster ASR. - October 22, 2025 π | |
| - Optimized pipeline for lower latency. - October 19, 2025 π | |
| - Added support for additional audio formats. - October 17, 2025 π | |
| - Enhanced gTTS audio quality. - October 14, 2025 π | |
| - Improved translation accuracy for Spanish. π£οΈ - October 12, 2025 π | |
| - Upgraded Whisper-Tiny model for faster ASR. π - October 11, 2025 π | |
| - Optimized pipeline for lower latency. ποΈ - October 10, 2025 π | |
| - Added support for additional audio formats. β‘ - October 08, 2025 π | |
| - Enhanced gTTS audio quality. - October 06, 2025 π | |
| - Improved translation accuracy for Spanish. π₯ - October 05, 2025 π | |
| - Upgraded Whisper-Tiny model for faster ASR. π - October 03, 2025 π | |
| - Optimized pipeline for lower latency. ποΈ - October 01, 2025 π | |
| - Added support for additional audio formats. - September 30, 2025 π | |
| - Enhanced gTTS audio quality. - September 28, 2025 π | |
| - Improved translation accuracy for Spanish. β‘ - September 26, 2025 π | |
| - Upgraded Whisper-Tiny model for faster ASR. π£οΈ - September 25, 2025 π | |
| - Optimized pipeline for lower latency. - September 23, 2025 π | |
| - Added support for additional audio formats. - September 21, 2025 π | |
| - Enhanced gTTS audio quality. π - September 20, 2025 π | |
| - Improved translation accuracy for Spanish. - September 18, 2025 π | |
| - Upgraded Whisper-Tiny model for faster ASR. π₯ - September 16, 2025 π | |
| - Optimized pipeline for lower latency. - September 15, 2025 π | |
| - Added support for additional audio formats. ποΈ - September 13, 2025 π | |
| - Enhanced gTTS audio quality. β‘ - September 11, 2025 π | |
| - Improved translation accuracy for Spanish. - September 10, 2025 π | |
| - Upgraded Whisper-Tiny model for faster ASR. - September 08, 2025 π | |
| - Optimized pipeline for lower latency. - September 06, 2025 π | |
| - Added support for additional audio formats. - September 05, 2025 π | |
| - Enhanced gTTS audio quality. π£οΈ - September 03, 2025 π | |
| - Improved translation accuracy for Spanish. β‘ - September 01, 2025 π | |
| - Upgraded Whisper-Tiny model for faster ASR. - August 31, 2025 π | |
| - Optimized pipeline for lower latency. - August 29, 2025 π | |
| - Added support for additional audio formats. - August 27, 2025 π | |
| - Enhanced gTTS audio quality. π£οΈ - August 26, 2025 π | |
| - Improved translation accuracy for Spanish. π₯ - August 24, 2025 π | |
| - Upgraded Whisper-Tiny model for faster ASR. ποΈ - August 22, 2025 π | |
| - Optimized pipeline for lower latency. - August 21, 2025 π | |
| - Added support for additional audio formats. π - August 19, 2025 π | |
| - Enhanced gTTS audio quality. β‘ - August 17, 2025 π | |
| - Improved translation accuracy for Spanish. - August 16, 2025 π | |
| - Upgraded Whisper-Tiny model for faster ASR. ποΈ - August 15, 2025 π | |
| - Optimized pipeline for lower latency. - August 14, 2025 π | |
| - Added support for additional audio formats. π£οΈ - August 13, 2025 π | |
| - Enhanced gTTS audio quality. π - August 12, 2025 π | |
| - Improved translation accuracy for Spanish. π₯ - August 11, 2025 π | |
| - Upgraded Whisper-Tiny model for faster ASR. - August 10, 2025 π | |
| - Optimized pipeline for lower latency. - August 09, 2025 π | |
| - Added support for additional audio formats. π₯ - August 08, 2025 π | |
| - Enhanced gTTS audio quality. ποΈ - August 07, 2025 π | |
| - Improved translation accuracy for Spanish. β‘ - August 06, 2025 π | |
| - Upgraded Whisper-Tiny model for faster ASR. π£οΈ - August 05, 2025 π | |
| - Optimized pipeline for lower latency. - August 04, 2025 π | |
| - Added support for additional audio formats. π - August 03, 2025 π | |
| - Enhanced gTTS audio quality. - August 02, 2025 π | |
| - Improved translation accuracy for Spanish. ποΈ - August 01, 2025 π | |
| - Upgraded Whisper-Tiny model for faster ASR. - July 31, 2025 π | |
| - Optimized pipeline for lower latency. π£οΈ - July 30, 2025 π | |
| - Added support for additional audio formats. π₯ - July 29, 2025 π | |
| - Enhanced gTTS audio quality. β‘ - July 28, 2025 π | |
| - Improved translation accuracy for Spanish. - July 27, 2025 π | |
| - Upgraded Whisper-Tiny model for faster ASR. - July 26, 2025 π | |
| - Optimized pipeline for lower latency. - July 25, 2025 π | |
| - Added support for additional audio formats. π - July 24, 2025 π | |
| - Enhanced gTTS audio quality. - July 23, 2025 π | |
| - Improved translation accuracy for Spanish. ποΈ - July 22, 2025 π | |
| - Upgraded Whisper-Tiny model for faster ASR. - July 21, 2025 π | |
| - Optimized pipeline for lower latency. π£οΈ - July 20, 2025 π | |
| - Added support for additional audio formats. π₯ - July 19, 2025 π | |
| - Enhanced gTTS audio quality. π - July 18, 2025 π | |
| - Improved translation accuracy for Spanish. β‘ - July 17, 2025 π | |
| - Upgraded Whisper-Tiny model for faster ASR. - July 16, 2025 π | |
| - Optimized pipeline for lower latency. - July 15, 2025 π | |
| - Added support for additional audio formats. π - July 11, 2025 π | |
| - Enhanced gTTS audio quality. - July 10, 2025 π | |
| - Improved translation accuracy for Spanish. - July 09, 2025 π | |
| - Upgraded Whisper-Tiny model for faster ASR. β‘ - July 08, 2025 π | |
| - Optimized pipeline for lower latency. π£οΈ - July 07, 2025 π | |
| - Added support for additional audio formats. - July 06, 2025 π | |
| - Enhanced gTTS audio quality. π₯ - July 05, 2025 π | |
| - Improved translation accuracy for Spanish. ποΈ - July 04, 2025 π | |
| - Upgraded Whisper-Tiny model for faster ASR. - July 03, 2025 π | |
| - Optimized pipeline for lower latency. - July 02, 2025 π | |
| - Added support for additional audio formats. - July 01, 2025 π | |
| - Enhanced gTTS audio quality. - June 30, 2025 π | |
| - Improved translation accuracy for Spanish. β‘ - June 29, 2025 π | |
| - Upgraded Whisper-Tiny model for faster ASR. - June 28, 2025 π | |
| - Optimized pipeline for lower latency. - June 27, 2025 π | |
| - Added support for additional audio formats. - June 26, 2025 π | |
| - Enhanced gTTS audio quality. π - June 25, 2025 π | |
| - Improved translation accuracy for Spanish. - June 24, 2025 π | |
| - Upgraded Whisper-Tiny model for faster ASR. π£οΈ - June 23, 2025 π | |
| - Optimized pipeline for lower latency. π₯ - June 22, 2025 π | |
| - Added support for additional audio formats. ποΈ - June 21, 2025 π | |
| - Enhanced gTTS audio quality. - June 20, 2025 π | |
| - Improved translation accuracy for Spanish. β‘ - June 19, 2025 π | |
| - Upgraded Whisper-Tiny model for faster ASR. - June 18, 2025 π | |
| - Optimized pipeline for lower latency. π - June 17, 2025 π | |
| - Added support for additional audio formats. ποΈ - June 16, 2025 π | |
| - Enhanced gTTS audio quality. - June 15, 2025 π | |
| - Improved translation accuracy for Spanish. π£οΈ - June 14, 2025 π | |
| - Upgraded Whisper-Tiny model for faster ASR. - June 13, 2025 π | |
| - Optimized pipeline for lower latency. π₯ - June 12, 2025 π | |
| - Added support for additional audio formats. β‘ - June 11, 2025 π | |
| - Enhanced gTTS audio quality. - June 10, 2025 π | |
| - Improved translation accuracy for Spanish. ποΈ - June 09, 2025 π | |
| - Upgraded Whisper-Tiny model for faster ASR. - June 08, 2025 π | |
| - Optimized pipeline for lower latency. π₯ - June 07, 2025 π | |
| - Added support for additional audio formats. π - June 06, 2025 π | |
| - Enhanced gTTS audio quality. π£οΈ - June 05, 2025 π | |
| - Improved translation accuracy for Spanish. ποΈ - June 04, 2025 π | |
| - Upgraded Whisper-Tiny model for faster ASR. π - June 03, 2025 π | |
| - Optimized pipeline for lower latency. π₯ - June 02, 2025 π | |
| - Added support for additional audio formats. π£οΈ - June 01, 2025 π | |
| - Enhanced gTTS audio quality. - May 31, 2025 π | |
| - Improved translation accuracy for Spanish. β‘ - May 30, 2025 π | |
| **Website**: https://ghostainews.com/ | |
| **Discord**: https://discord.gg/BfA23aYz |