voidful/NMSQA
Viewer • Updated • 98.3k • 370 • 7
How to use menevsem/hubert-base-ls960-nmsqa-asr with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("automatic-speech-recognition", model="menevsem/hubert-base-ls960-nmsqa-asr") # Load model directly
from transformers import AutoProcessor, AutoModel
processor = AutoProcessor.from_pretrained("menevsem/hubert-base-ls960-nmsqa-asr")
model = AutoModel.from_pretrained("menevsem/hubert-base-ls960-nmsqa-asr")This model was pretrained using facebook/hubert-base-ls960 model on NMSQA dataset. The task is Automatic Speech Recognition (ASR) in which the questions and context sentences are used.
This is a checkpoint with WER 14.36 on dev set.
The input of the models are from NMSQA dataset. The task of the dataset is Spoken QA, but in this model I used the sentences for ASR. The input audios are both from context and questions. This ASR model was trained on using training and dev set of NMSQA.
The model should be used as fine-tuned model for wav2vec2.
from transformers import AutoModel
model = AutoModel.from_pretrained("menevsem/hubert-base-ls960-nmsqa-asr")
The model was trained using voidful/NMSQA train and dev set.
For evalaution WER metric is used on dev set.