mozilla-foundation/common_voice_13_0
Updated • 2.36k • 3
How to use seiching/whisper-large-seiching with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("automatic-speech-recognition", model="seiching/whisper-large-seiching") # Load model directly
from transformers import AutoProcessor, AutoModelForSpeechSeq2Seq
processor = AutoProcessor.from_pretrained("seiching/whisper-large-seiching")
model = AutoModelForSpeechSeq2Seq.from_pretrained("seiching/whisper-large-seiching")This model is a fine-tuned version of openai/whisper-large on the Common Voice 13 dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss | Wer Ortho | Wer |
|---|---|---|---|---|---|
| 0.0361 | 0.69 | 500 | 0.1989 | 38.3627 | 37.9517 |
| 0.0105 | 1.38 | 1000 | 0.2217 | 39.0259 | 38.9100 |
| 0.0208 | 2.06 | 1500 | 0.2299 | 39.6891 | 39.3292 |
| 0.0091 | 2.75 | 2000 | 0.2264 | 39.8964 | 39.4091 |
| 0.0153 | 3.44 | 2500 | 0.2363 | 39.8135 | 39.3891 |
| 0.0191 | 4.13 | 3000 | 0.2415 | 40.1865 | 40.0080 |
| 0.0061 | 4.81 | 3500 | 0.2542 | 41.1813 | 39.9281 |
| 0.0107 | 5.5 | 4000 | 0.2457 | 40.3316 | 39.9281 |