F5-TTS
F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
Generate Talking avatars from Text-to-Speech
Generate speech in a cloned voice from reference audio
Transcribe or translate audio and YouTube videos to text
Identify emotion from multi-lingual audio
Combine voice cloning and portrait lipsync animation
Voice conversion framework based on VITS
Transcribe audio files to readable text instantly
Transcribe audio to text with speaker diarization
Combine and process audio files with effects
Transcribe audio files into text
Generate subtitled videos from YouTube links
Convert audio to subtitles
Generate Cantonese speech from text
Generate high-quality speech from text using a prompt audio