PolyAI/minds14
Viewer • Updated • 16.3k • 6.08k • 103
How to use Imxxn/AudioCourseU5-ASR with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("automatic-speech-recognition", model="Imxxn/AudioCourseU5-ASR") # Load model directly
from transformers import AutoProcessor, AutoModelForSpeechSeq2Seq
processor = AutoProcessor.from_pretrained("Imxxn/AudioCourseU5-ASR")
model = AutoModelForSpeechSeq2Seq.from_pretrained("Imxxn/AudioCourseU5-ASR")# Load model directly
from transformers import AutoProcessor, AutoModelForSpeechSeq2Seq
processor = AutoProcessor.from_pretrained("Imxxn/AudioCourseU5-ASR")
model = AutoModelForSpeechSeq2Seq.from_pretrained("Imxxn/AudioCourseU5-ASR")This model is a fine-tuned version of openai/whisper-tiny on the None dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss | Wer Ortho | Wer |
|---|---|---|---|---|---|
| 0.3065 | 3.57 | 100 | 0.4921 | 36.8908 | 0.3577 |
| 0.0391 | 7.14 | 200 | 0.5425 | 35.3486 | 0.3436 |
| 0.0042 | 10.71 | 300 | 0.5878 | 35.6570 | 0.3495 |
| 0.0012 | 14.29 | 400 | 0.6206 | 34.2998 | 0.3377 |
| 0.0007 | 17.86 | 500 | 0.6438 | 34.4849 | 0.3406 |
Base model
openai/whisper-tiny
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("automatic-speech-recognition", model="Imxxn/AudioCourseU5-ASR")