CoRal-project/roest-v3-whisper-1.5b
Automatic Speech Recognition • 2B • Updated
• 180 • 4
The third generation of our Danish ASR and TTS models.
Note Our large Whisper-based speech recognition model, with the best performance across all demographics.
Note Our small Wav2vec2-based speech recognition model, being 5x smaller than the large one, and which does not hallucinate.
Note Our large Chatterbox-based speech synthesis model, with the best quality speech.
Note Our small Chatterbox-based speech synthesis model, more suitable for real-time applications.
Note Our dataset used to train our speech recognition models.