A newer version of the Gradio SDK is available:
6.2.0
metadata
title: Audio/Video Translation Toolkit
emoji: π
colorFrom: indigo
colorTo: purple
sdk: gradio
python_version: 3.10.0
app_file: app.py
tags:
- translation
- audio
- video
- speech-synthesis
- voice-cloning
- gradio
π Audio/Video Translation Toolkit
This application provides a complete pipeline for translating the audio of a video or audio file from English to various Indian languages. It handles everything from vocal separation and transcription to translation, speech synthesis, and voice cloning.
## Key Features π οΈ
- π¬ Full Video Translation: Upload a video, and the app will extract the audio, translate it, and merge it back into the original video.
- π΅ Full Audio Translation: Translate standalone audio files.
- π£οΈ Vocal Separation: Isolate vocals from background music before processing.
- βοΈ Transcription & Pace Detection: Uses Whisper to transcribe the audio and determine the original speaker's pace.
- π Multi-Lingual Translation: Translate text to Tamil, Telugu, or Hindi using either local models or the Sarvam API.
- π Speech Synthesis: Generate new speech in the target language using models from
ai4bharat. - 𧬠Voice Cloning: Clone the voice from the original speaker onto the newly synthesized audio for a more natural result.
## How to Use the Main Pipeline
- Navigate to the Translate Video or Translate Audio tab.
- Upload your file.
- Select the Target Language.
- Click the Translate button and wait for the process to complete.