Speech Recognition Models
Collection
Models for Welsh language and bilingual speech recognition β’ 6 items β’ Updated
CTranslate2 int8 quantised version of techiaith/whisper-large-ft-cy-en.
This model provides faster inference with lower memory usage. See the source model for full details on training data, evaluation results, and usage.
import faster_whisper
model = faster_whisper.WhisperModel("techiaith/whisper-large-ft-cy-en-ct2")
# Welsh transcription
segments, info = model.transcribe("welsh_audio.wav", language="cy", task="transcribe")
# Welsh to English translation
segments, info = model.transcribe("welsh_audio.wav", language="cy", task="translate")