F5-TTS
F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
Generate Talking avatars from Text-to-Speech
Generate speech in a cloned voice
Transcribe speech from audio or YouTube videos into text
Identify emotion from multi-lingual audio
Combine voice cloning and portrait lipsync animation
Voice conversion framework based on VITS
Transcribe audio files to text instantly
Transcribe audio to text with speaker diarization
Combine and process audio files with effects
Transcribe audio files into text
Generate subtitled videos from YouTube links
Convert audio to subtitles
Generate Cantonese speech from text
Generate high-quality speech from text using a prompt audio