iFaz/facebook_voxpopulik_16k_Whisper_Compatible
Viewer • Updated • 10.3k • 4
How to use iFaz/whisper-SER-base-v2 with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("automatic-speech-recognition", model="iFaz/whisper-SER-base-v2") # Load model directly
from transformers import AutoProcessor, AutoModelForSpeechSeq2Seq
processor = AutoProcessor.from_pretrained("iFaz/whisper-SER-base-v2")
model = AutoModelForSpeechSeq2Seq.from_pretrained("iFaz/whisper-SER-base-v2")This model is a fine-tuned version of openai/whisper-base on the facebook_voxpopulik_16k_Whisper_Compatible dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss | Wer |
|---|---|---|---|---|
| 0.4753 | 0.4322 | 1000 | 0.4532 | 24.8077 |
| 0.4303 | 0.8643 | 2000 | 0.4212 | 25.1645 |
| 0.2697 | 1.2965 | 3000 | 0.4265 | 27.7174 |
| 0.2267 | 1.7286 | 4000 | 0.4122 | 27.1307 |
| 0.1764 | 2.1608 | 5000 | 0.4505 | 39.1422 |
| 0.2175 | 2.5929 | 6000 | 0.4206 | 26.8770 |
| 0.0845 | 3.0251 | 7000 | 0.4547 | 32.9739 |
| 0.0907 | 3.4572 | 8000 | 0.4707 | 28.8353 |
| 0.0968 | 3.8894 | 9000 | 0.4768 | 32.9660 |
| 0.0495 | 4.3215 | 10000 | 0.5026 | 31.2455 |
| 0.051 | 4.7537 | 11000 | 0.5037 | 32.8312 |
| 0.0668 | 5.1858 | 12000 | 0.5113 | 31.9908 |
Base model
openai/whisper-base