mozilla-foundation/common_voice_17_0
Updated • 5.89k • 19
How to use SmartWhatt/rangsitra-whisper-md-cv with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("automatic-speech-recognition", model="SmartWhatt/rangsitra-whisper-md-cv") # Load model directly
from transformers import AutoProcessor, AutoModelForSpeechSeq2Seq
processor = AutoProcessor.from_pretrained("SmartWhatt/rangsitra-whisper-md-cv")
model = AutoModelForSpeechSeq2Seq.from_pretrained("SmartWhatt/rangsitra-whisper-md-cv")This model is a fine-tuned version of openai/whisper-medium on the Common Voice 17.0 dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Cer | Wer | Validation Loss |
|---|---|---|---|---|---|
| 0.1709 | 1.0 | 2052 | 6.2976 | 22.0505 | 0.1529 |
| 0.1076 | 2.0 | 4104 | 5.5274 | 18.8083 | 0.1377 |
| 0.0509 | 3.0 | 6156 | 4.7389 | 16.4986 | 0.1399 |
| 0.0223 | 4.0 | 8208 | 4.3180 | 14.9002 | 0.1409 |
| 0.004 | 5.0 | 10260 | 4.0149 | 14.0500 | 0.1453 |
Base model
openai/whisper-medium