text-to-speech
updated
FlashSpeech: Efficient Zero-Shot Speech Synthesis
Paper
• 2404.14700
• Published • 32
Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale
Paper
• 2306.15687
• Published • 1
NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and
Diffusion Models
Paper
• 2403.03100
• Published • 37
Tango 2: Aligning Diffusion-based Text-to-Audio Generations through
Direct Preference Optimization
Paper
• 2404.09956
• Published • 11
Mega-TTS 2: Zero-Shot Text-to-Speech with Arbitrary Length Speech
Prompts
Paper
• 2307.07218
• Published • 28
Mega-TTS: Zero-Shot Text-to-Speech at Scale with Intrinsic Inductive
Bias
Paper
• 2306.03509
• Published • 5
parler-tts/dac_44khZ_8kbps
76.7M • Updated • 441
• 19
parler-tts/parler_tts_mini_v0.1
Text-to-Speech
• 0.6B • Updated • 4.25k
• 358
Wenetspeech4TTS/WenetSpeech4TTS
Updated • 985
• 86
Text-to-Audio
• Updated • 1
• 9
Feature Extraction
• 96.2M • Updated • 928k
• • 299
Text-to-Speech
• Updated • 9.54M
• • 6.11k
Text-to-Speech
• 4B • Updated • 171
• 526
Text-to-Speech
• Updated • 2.31k
• 1.1k
stepfun-ai/Step-Audio-TTS-3B
Text-to-Speech
• 4B • Updated • 110
• 197
Text-to-Speech
• Updated • 131
• 417
Text-to-Speech
• Updated • 53.6k
• • 2.86k