iFaz/Whisper_Compatible_SER_benchmark
Viewer • Updated • 31.4k • 22
How to use iFaz/whisper-base-SER-v5_2 with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("automatic-speech-recognition", model="iFaz/whisper-base-SER-v5_2") # Load model directly
from transformers import AutoProcessor, AutoModelForSpeechSeq2Seq
processor = AutoProcessor.from_pretrained("iFaz/whisper-base-SER-v5_2")
model = AutoModelForSpeechSeq2Seq.from_pretrained("iFaz/whisper-base-SER-v5_2")
)
This model is a fine-tuned version of openai/whisper-base on the Whisper_Compatible_SER_benchmark(Not train_augmented) dataset.
It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss | Wer |
|---|---|---|---|---|
| 0.0004 | 250.0 | 1000 | 0.2744 | 665.0 |
| 0.0001 | 500.0 | 2000 | 0.3142 | 413.0 |
| 0.0 | 750.0 | 3000 | 0.3356 | 239.0 |
| 0.0 | 1000.0 | 4000 | 0.3451 | 239.0 |
| 0.0 | 1250.0 | 5000 | 0.3657 | 236.0 |
| 0.0 | 1500.0 | 6000 | 0.3675 | 236.0 |
Base model
openai/whisper-base