Malasar ASR Resources
Collection
5 items • Updated
How to use vrclc/Malasar_small_DTF with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("automatic-speech-recognition", model="vrclc/Malasar_small_DTF") # Load model directly
from transformers import AutoProcessor, AutoModelForSpeechSeq2Seq
processor = AutoProcessor.from_pretrained("vrclc/Malasar_small_DTF")
model = AutoModelForSpeechSeq2Seq.from_pretrained("vrclc/Malasar_small_DTF")# Load model directly
from transformers import AutoProcessor, AutoModelForSpeechSeq2Seq
processor = AutoProcessor.from_pretrained("vrclc/Malasar_small_DTF")
model = AutoModelForSpeechSeq2Seq.from_pretrained("vrclc/Malasar_small_DTF")This model is a fine-tuned version of openai/whisper-small on the Spoken Bible Corpus: Malasar dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss | Wer |
|---|---|---|---|---|
| 0.1612 | 11.3636 | 250 | 0.3185 | 64.0206 |
| 0.0114 | 22.7273 | 500 | 0.4682 | 65.7339 |
| 0.0016 | 34.0909 | 750 | 0.5380 | 59.9657 |
| 0.0004 | 45.4545 | 1000 | 0.5761 | 59.8515 |
| 0.0003 | 56.8182 | 1250 | 0.5969 | 59.6802 |
| 0.0002 | 68.1818 | 1500 | 0.6104 | 60.3655 |
| 0.0002 | 79.5455 | 1750 | 0.6181 | 60.1942 |
| 0.0002 | 90.9091 | 2000 | 0.6211 | 60.1371 |
Base model
openai/whisper-small
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("automatic-speech-recognition", model="vrclc/Malasar_small_DTF")