--- title: Audio Translator emoji: πŸ”₯ colorFrom: pink colorTo: purple sdk: gradio sdk_version: 5.31.0 app_file: app.py pinned: false license: apache-2.0 short_description: Audio Translator --- # πŸ—£οΈ Audio Translator [![Hugging Face Space](https://img.shields.io/badge/HuggingFace-Spaces-blue?logo=huggingface)](https://huggingface.co/spaces//audio-translator) [![Gradio UI](https://img.shields.io/badge/Gradio-5.31.0-brightgreen?logo=gradio)] [![Model: Whisper Tiny](https://img.shields.io/badge/ASR-Whisper--tiny-orange)] [![Translator: Deep-Translator](https://img.shields.io/badge/Translator-GoogleTranslator-blue)] [![TTS: gTTS](https://img.shields.io/badge/TTS-gTTS-yellow)] [![License](https://img.shields.io/badge/License-MIT-lightgrey)](LICENSE) --- ## πŸš€ Overview Combine **ASR**, **machine translation**, and **neural TTS** into one **seamless audio pipeline**β€”100 % **CPU** on free-tier HF Spaces. Upload speech, auto-detect language, translate into English or Spanish, then hear it spoken back. > **AI buzzwords:** > β€’ Automatic Speech Recognition (ASR) β€’ Whisper Tiny β€’ Neural Machine Translation β€’ GoogleTranslator β€’ Text-to-Speech β€’ gTTS β€’ Multi-modal AI β€’ End-to-End Inference β€’ Real-Time β€’ Edge Deployment --- ## ✨ Features | πŸ”‘ Feature | πŸ” Description | |---------------------------|---------------------------------------------------------------| | **πŸŽ™οΈ ASR: Whisper-Tiny** | Lightning-fast, on-device speech transcription (all languages) | | **🌐 Translation** | Bidirectional English ↔ Spanish via Deep-Translator | | **πŸ—£οΈ Neural TTS** | High-quality audio playback via the free Google Translate TTS | | **⚑ Zero-infra CPU** | Runs on 2 vCPU / 16 GB RAMβ€”no GPU or paid APIs needed | | **🎨 Elegant UI** | Intuitive Gradio Blocksβ€”upload, buttons, transcripts, audio | | **πŸ”§ Fully Modular** | Swap models or add logging/analytics with minimal edits | --- ## πŸ—οΈ Architecture & Workflow 1. **Audio Upload** User uploads any `.wav` or `.mp3` clip. 2. **ASR** OpenAI’s `whisper-tiny` decodes speech into text. 3. **MT** `deep-translator`’s GoogleTranslator converts text to chosen language. 4. **TTS** `gTTS` synthesizes the translated text into an `.mp3`. 5. **UI Rendering** Gradio presents the original transcript, the translation, and an audio player. --- ## πŸ› οΈ Quick Start (Local Dev) ```bash git clone https://github.com//audio-translator.git cd audio-translator python3 -m venv venv && source venv/bin/activate pip install -r requirements.txt python app.py ## Latest Update - Upgraded Whisper-Tiny model for faster ASR. - May 29, 2025 πŸ“ - Enhanced gTTS audio quality. πŸ—£οΈ - June 05, 2025 πŸ“ - Improved translation accuracy for Spanish. πŸŽ™οΈ - June 04, 2025 πŸ“ - Upgraded Whisper-Tiny model for faster ASR. 🌐 - June 03, 2025 πŸ“ - Optimized pipeline for lower latency. πŸ”₯ - June 02, 2025 πŸ“ - Added support for additional audio formats. πŸ—£οΈ - June 01, 2025 πŸ“ - Enhanced gTTS audio quality. - May 31, 2025 πŸ“ - Improved translation accuracy for Spanish. ⚑ - May 30, 2025 πŸ“ **Website**: https://ghostainews.com/ **Discord**: https://discord.gg/BfA23aYz