High quality, efficient voice cloning. Just 100M parameters.
All-in-one hub of general purpose tools useful for any agent
CPU - Gradio. Old smol TTS champ. 54 voices.
Local whisper but for current year
Generate speech from text using multiple TTS services
Chat with AI using text and images
Generate images from text prompts