google/fleurs
Viewer • Updated • 768k • 60.2k • 406
How to use Scrya/whisper-medium-vi-augmented with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("automatic-speech-recognition", model="Scrya/whisper-medium-vi-augmented") # Load model directly
from transformers import AutoProcessor, AutoModelForSpeechSeq2Seq
processor = AutoProcessor.from_pretrained("Scrya/whisper-medium-vi-augmented")
model = AutoModelForSpeechSeq2Seq.from_pretrained("Scrya/whisper-medium-vi-augmented")# Load model directly
from transformers import AutoProcessor, AutoModelForSpeechSeq2Seq
processor = AutoProcessor.from_pretrained("Scrya/whisper-medium-vi-augmented")
model = AutoModelForSpeechSeq2Seq.from_pretrained("Scrya/whisper-medium-vi-augmented")This model is a fine-tuned version of openai/whisper-medium on the following datasets:
It achieves the following results on the evaluation set:
More information needed
More information needed
Training:
Evaluation:
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss | Wer | Cer |
|---|---|---|---|---|---|
| 0.1992 | 1.8 | 1000 | 0.2726 | 17.4929 | 8.2562 |
| 0.0402 | 3.6 | 2000 | 0.3317 | 17.4929 | 8.2588 |
| 0.0073 | 5.4 | 3000 | 0.3429 | 17.6793 | 8.8913 |
| 0.0014 | 7.19 | 4000 | 0.3599 | 19.0283 | 9.5103 |
| 0.0006 | 8.99 | 5000 | 0.3696 | 16.6594 | 7.7625 |
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("automatic-speech-recognition", model="Scrya/whisper-medium-vi-augmented")