wav2vec2-base-qxp-finetuned

Model Description

This model is a fine-tuned version of facebook/wav2vec2-base for Automatic Speech Recognition (ASR) in Puno Quechua (qxp).

Base pretrained model:

This model was developed by:

The development and training of this model involved the following research laboratories and institutions:

The model was fine-tuned using the following dataset:

Dataset name: Mozilla Data Collective - Scripted Speech and Spontaneous Speech corpora
Language: Puno Quechua
Total duration: 65h
Speech type: read speech / elicited speech
Source: Mozilla Data Collective

The fine-tuning process was performed using the following hardware and software configuration:

Safetensors

Model size

94.4M params

Tensor type

F32

Base model

Finetuned

(987)

this model