mozilla-foundation/common_voice_17_0
Updated • 5.54k • 16
How to use internalhell/whisper_small_ru_model_trainer_2 with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("automatic-speech-recognition", model="internalhell/whisper_small_ru_model_trainer_2") # Load model directly
from transformers import AutoProcessor, AutoModelForSpeechSeq2Seq
processor = AutoProcessor.from_pretrained("internalhell/whisper_small_ru_model_trainer_2")
model = AutoModelForSpeechSeq2Seq.from_pretrained("internalhell/whisper_small_ru_model_trainer_2")# Load model directly
from transformers import AutoProcessor, AutoModelForSpeechSeq2Seq
processor = AutoProcessor.from_pretrained("internalhell/whisper_small_ru_model_trainer_2")
model = AutoModelForSpeechSeq2Seq.from_pretrained("internalhell/whisper_small_ru_model_trainer_2")This model is a fine-tuned version of openai/whisper-small on the Common Voice 17.0 dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss | Wer | Cer | Ser |
|---|---|---|---|---|---|---|
| 0.2176 | 0.1516 | 500 | 0.2575 | 21.0009 | 5.4512 | 69.0581 |
| 0.2146 | 0.3032 | 1000 | 0.2395 | 19.7826 | 5.2221 | 66.5785 |
| 0.1817 | 0.4548 | 1500 | 0.2264 | 18.5724 | 4.7800 | 64.4320 |
| 0.1862 | 0.6064 | 2000 | 0.2140 | 18.2088 | 4.7904 | 62.3542 |
| 0.1618 | 0.7580 | 2500 | 0.2049 | 17.0765 | 4.3953 | 60.4234 |
| 0.1597 | 0.9096 | 3000 | 0.1989 | 16.6769 | 4.3640 | 59.1591 |
Base model
openai/whisper-small
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("automatic-speech-recognition", model="internalhell/whisper_small_ru_model_trainer_2")