Standard Haitian orthography and chunking

by lmselby - opened Jun 28, 2025

Jun 28, 2025

Hello, thank you for sharing your model. It is very difficult to find good ASR for Haitian. I am new to Hugging Face.

I am comparing various ASR approaches I can run in Colaboratory with other types of transcriptions. I am noticing that the results from running 30 second .wav files on your model results in some standard Haitian orthography, some approximation of the sound that does not conform to standard orthography. Could you comment on this challenge, please?

I have a nearly 21-minute, 19 MB .mp3 that I would like to transcribe without manually cutting it into clips in audio-editing software. Could you share the code I would necessarily need to add for chunking the audio to run with your model? Thank you.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment