Tes
Translate timed word data into subtitle lines with speed analysis
Translate timed word data into subtitle lines with speed analysis
Identify speakers in an audio transcript
Transcribe audio to word-level timestamps
Generate SRT subtitles from audio and timestamps
Align transcript words to audio and get timestamps
Decode Whisper encoder output into timed subtitles
Extract vocals from any song in seconds
Extract vocals from any audio file
Translate English SRT subtitles to Hindi instantly
Kokoro, But It Clones Voices Now
Clone a voice to speak custom text
RVC-v2 Beatrice-v2 - CPU inference + training
RVC-v2 Beatrice-v2 - CPU inference + training
Clone a voice from a sample to speak any text
Clone a voice to speak your text
Generate wordβlevel transcript with timestamps from audio
Transcribe audio with speaker diarization
Transcribe audio files to text with batch processing
Generate custom voice audio from text and a speaker sample