prasanacodes's picture
Update README.md
b1599c8 verified

A newer version of the Gradio SDK is available: 6.2.0

Upgrade
metadata
title: Audio/Video Translation Toolkit
emoji: πŸš€
colorFrom: indigo
colorTo: purple
sdk: gradio
python_version: 3.10.0
app_file: app.py
tags:
  - translation
  - audio
  - video
  - speech-synthesis
  - voice-cloning
  - gradio

πŸš€ Audio/Video Translation Toolkit

This application provides a complete pipeline for translating the audio of a video or audio file from English to various Indian languages. It handles everything from vocal separation and transcription to translation, speech synthesis, and voice cloning.


## Key Features πŸ› οΈ

  • 🎬 Full Video Translation: Upload a video, and the app will extract the audio, translate it, and merge it back into the original video.
  • 🎡 Full Audio Translation: Translate standalone audio files.
  • πŸ—£οΈ Vocal Separation: Isolate vocals from background music before processing.
  • ✍️ Transcription & Pace Detection: Uses Whisper to transcribe the audio and determine the original speaker's pace.
  • 🌐 Multi-Lingual Translation: Translate text to Tamil, Telugu, or Hindi using either local models or the Sarvam API.
  • πŸ”Š Speech Synthesis: Generate new speech in the target language using models from ai4bharat.
  • 🧬 Voice Cloning: Clone the voice from the original speaker onto the newly synthesized audio for a more natural result.

## How to Use the Main Pipeline

  1. Navigate to the Translate Video or Translate Audio tab.
  2. Upload your file.
  3. Select the Target Language.
  4. Click the Translate button and wait for the process to complete.