File size: 1,593 Bytes
04a936d b1599c8 04a936d b1599c8 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 |
---
title: Audio/Video Translation Toolkit
emoji: π
colorFrom: indigo
colorTo: purple
sdk: gradio
python_version: 3.10.0
app_file: app.py
tags:
- translation
- audio
- video
- speech-synthesis
- voice-cloning
- gradio
---
# π Audio/Video Translation Toolkit
This application provides a complete pipeline for translating the audio of a video or audio file from English to various Indian languages. It handles everything from vocal separation and transcription to translation, speech synthesis, and voice cloning.
---
## ## Key Features π οΈ
* **π¬ Full Video Translation:** Upload a video, and the app will extract the audio, translate it, and merge it back into the original video.
* **π΅ Full Audio Translation:** Translate standalone audio files.
* **π£οΈ Vocal Separation:** Isolate vocals from background music before processing.
* **βοΈ Transcription & Pace Detection:** Uses Whisper to transcribe the audio and determine the original speaker's pace.
* **π Multi-Lingual Translation:** Translate text to Tamil, Telugu, or Hindi using either local models or the Sarvam API.
* **π Speech Synthesis:** Generate new speech in the target language using models from `ai4bharat`.
* **𧬠Voice Cloning:** Clone the voice from the original speaker onto the newly synthesized audio for a more natural result.
---
## ## How to Use the Main Pipeline
1. Navigate to the **Translate Video** or **Translate Audio** tab.
2. Upload your file.
3. Select the **Target Language**.
4. Click the **Translate** button and wait for the process to complete.
--- |