speech31
/

XLS-R-english-phoneme

Automatic Speech Recognition

Model card Files Files and versions

Model Card for Model ID

Model Details

Model Description

Developed by: Eunjung Yeo
Model type: phone recognizer
Language(s) (SLP): English
Finetuned from model: XLS-R-300m

Direct Use

Phone recognition

Downstream Use [optional]

Analysis of phonetic transcriptions
L2 Pronunciation Assessment (Mispronunciation Detection and Diagnosis)
Mispronunciation Assessment for pathological speech

How to Get Started with the Model

from transformers import AutoProcessor, AutoModelForCTC

processor = AutoProcessor.from_pretrained("speech31/XLS-R-english-phoneme") model = AutoModelForCTC.from_pretrained("speech31/XLS-R-english-phoneme")

Training Details

Training Data

This model is fine-tuned on the TIMIT dataset. (Can be downloaded from https://catalog.ldc.upenn.edu/LDC93s1)

Preprocessing

The dataset was preprocessed using Epitran for transliterating text into IPA.

Downloads last month: 98