File size: 1,593 Bytes
04a936d
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
b1599c8
04a936d
b1599c8
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
---
title: Audio/Video Translation Toolkit
emoji: πŸš€
colorFrom: indigo
colorTo: purple
sdk: gradio
python_version: 3.10.0
app_file: app.py
tags:
  - translation
  - audio
  - video
  - speech-synthesis
  - voice-cloning
  - gradio
---

# πŸš€ Audio/Video Translation Toolkit

This application provides a complete pipeline for translating the audio of a video or audio file from English to various Indian languages. It handles everything from vocal separation and transcription to translation, speech synthesis, and voice cloning.

---

## ## Key Features πŸ› οΈ

* **🎬 Full Video Translation:** Upload a video, and the app will extract the audio, translate it, and merge it back into the original video.
* **🎡 Full Audio Translation:** Translate standalone audio files.
* **πŸ—£οΈ Vocal Separation:** Isolate vocals from background music before processing.
* **✍️ Transcription & Pace Detection:** Uses Whisper to transcribe the audio and determine the original speaker's pace.
* **🌐 Multi-Lingual Translation:** Translate text to Tamil, Telugu, or Hindi using either local models or the Sarvam API.
* **πŸ”Š Speech Synthesis:** Generate new speech in the target language using models from `ai4bharat`.
* **🧬 Voice Cloning:** Clone the voice from the original speaker onto the newly synthesized audio for a more natural result.

---

## ## How to Use the Main Pipeline

1.  Navigate to the **Translate Video** or **Translate Audio** tab.
2.  Upload your file.
3.  Select the **Target Language**.
4.  Click the **Translate** button and wait for the process to complete.

---