hexgrad/Kokoro-82M
Text-to-Speech β’ Updated β’ 9.51M β’ β’ 6.11k
Generate high-quality speech from text using a prompt audio
Generate animated videos from images and motion sequences
Generate images from text or an input picture
View the LMArena model leaderboard