DTU54DL/common-accent
Viewer • Updated • 10.5k • 207 • 10
How to use Wishwa98/ASRForCommonVoice with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("automatic-speech-recognition", model="Wishwa98/ASRForCommonVoice") # Load model directly
from transformers import AutoProcessor, AutoModelForSpeechSeq2Seq
processor = AutoProcessor.from_pretrained("Wishwa98/ASRForCommonVoice")
model = AutoModelForSpeechSeq2Seq.from_pretrained("Wishwa98/ASRForCommonVoice")This model is a fine-tuned version of openai/whisper-small on the Common Accent dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss | Wer Ortho | Wer |
|---|---|---|---|---|---|
| 0.1012 | 1.14 | 500 | 0.3215 | 16.3784 | 11.5941 |
| 0.0345 | 2.28 | 1000 | 0.3483 | 16.6496 | 11.8450 |
| 0.018 | 3.42 | 1500 | 0.3829 | 17.1622 | 12.4707 |
| 0.0075 | 4.57 | 2000 | 0.4069 | 17.8667 | 13.0116 |
| 0.0059 | 5.71 | 2500 | 0.4234 | 17.9229 | 13.0605 |
Base model
openai/whisper-small