iFaz/common_voice_17_0_emotion_5k
Viewer • Updated • 4.43k • 23
How to use iFaz/whisper-base-en-emo-v1 with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("automatic-speech-recognition", model="iFaz/whisper-base-en-emo-v1") # Load model directly
from transformers import AutoProcessor, AutoModelForSpeechSeq2Seq
processor = AutoProcessor.from_pretrained("iFaz/whisper-base-en-emo-v1")
model = AutoModelForSpeechSeq2Seq.from_pretrained("iFaz/whisper-base-en-emo-v1")This model is a fine-tuned version of openai/whisper-base on the Common Voice 17.0(Emotional Tag) dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss | Wer |
|---|---|---|---|---|
| 0.0074 | 40.0 | 1000 | 0.8128 | 130.1508 |
| 0.0002 | 80.0 | 2000 | 0.9065 | 114.5729 |
| 0.0001 | 120.0 | 3000 | 0.9507 | 109.0452 |
| 0.0001 | 160.0 | 4000 | 0.9660 | 120.1005 |
Base model
openai/whisper-base