pyannote/speaker-diarization-3.1
Automatic Speech Recognition • Updated • 9.59M • 2.11k
Detect human poses in images and videos
Generate speech from text using a reference voice
✨[With v1.0.0] Accelerated TTS on Kokoro-82M
Fast, efficient, & multilingual text-to-speech
Generate spoken audio from text using Edge TTS
Efficient, fast, and natural text to speech with StyleTTS 2!
High-fidelity Text-To-Speech
Generate realistic speech and sounds from typed text
Generate spoken audio from text using selectable voices
Vote on the latest TTS models!
Transcribe audio files into text instantly
High-quality speech synthesis powered by Kokoro TTS