iFaz/Whisper_Compatible_SER_benchmark
Viewer • Updated • 31.4k • 22
How to use iFaz/whisper-SER-base-v1 with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("automatic-speech-recognition", model="iFaz/whisper-SER-base-v1") # Load model directly
from transformers import AutoProcessor, AutoModelForSpeechSeq2Seq
processor = AutoProcessor.from_pretrained("iFaz/whisper-SER-base-v1")
model = AutoModelForSpeechSeq2Seq.from_pretrained("iFaz/whisper-SER-base-v1")This model is a fine-tuned version of openai/whisper-base on the Whisper_Compatible_SER_benchmark(Not train_augmented) dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss | Wer |
|---|---|---|---|---|
| 0.1761 | 2.4450 | 1000 | 0.5625 | 48.9594 |
| 0.0796 | 4.8900 | 2000 | 0.5905 | 87.2151 |
| 0.0201 | 7.3350 | 3000 | 0.7191 | 125.5203 |
| 0.0054 | 9.7800 | 4000 | 0.7985 | 127.7998 |
| 0.0012 | 12.2249 | 5000 | 0.8611 | 108.0278 |
| 0.0008 | 14.6699 | 6000 | 0.8757 | 105.4509 |
Base model
openai/whisper-base