YAML Metadata Warning: empty or missing yaml metadata in repo card

Check out the documentation for more information.

Personal speech to text model

Speech to Text models often do not understand my accent, so I fine tuned this one from "distil-whisper/distil-medium.en" using about 1000 recordings of my voice, comprising of about 2h of recordings. The system goes from ~12% WER to ~8% WER.

Do not download unless you have exactly my accent (North-East Italy).

Downloads last month
3
Safetensors
Model size
0.4B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support