Whisper
Collection
Whisper model fine-tuned on different languages • 10 items • Updated
How to use ALM/whisper-it-medium-augmented with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("automatic-speech-recognition", model="ALM/whisper-it-medium-augmented") # Load model directly
from transformers import AutoProcessor, AutoModelForSpeechSeq2Seq
processor = AutoProcessor.from_pretrained("ALM/whisper-it-medium-augmented")
model = AutoModelForSpeechSeq2Seq.from_pretrained("ALM/whisper-it-medium-augmented")# Load model directly
from transformers import AutoProcessor, AutoModelForSpeechSeq2Seq
processor = AutoProcessor.from_pretrained("ALM/whisper-it-medium-augmented")
model = AutoModelForSpeechSeq2Seq.from_pretrained("ALM/whisper-it-medium-augmented")This model is a fine-tuned version of openai/whisper-medium on the mozilla-foundation/common_voice_11_0 it dataset. It achieves the following results on the evaluation set:
IMPORTANT The model has been trained using data augmentation to improve its generalization capabilities and robustness. The results on the eval set during training are biased towards data augmentation applied to evaluation data.
Results on eval set
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss | Wer |
|---|---|---|---|---|
| 0.226 | 0.33 | 2500 | 0.2779 | 14.6642 |
| 0.1278 | 1.03 | 5000 | 0.1818 | 10.2049 |
| 0.0304 | 1.36 | 7500 | 0.1388 | 7.5544 |
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("automatic-speech-recognition", model="ALM/whisper-it-medium-augmented")