Model Card for Model ID
Model Details
Model Description
- Developed by: Eunjung Yeo
- Model type: phone recognizer
- Language(s) (SLP): English
- Finetuned from model: XLS-R-300m
Direct Use
- Phone recognition
Downstream Use [optional]
- Analysis of phonetic transcriptions
- L2 Pronunciation Assessment (Mispronunciation Detection and Diagnosis)
- Mispronunciation Assessment for pathological speech
How to Get Started with the Model
from transformers import AutoProcessor, AutoModelForCTC
processor = AutoProcessor.from_pretrained("speech31/XLS-R-english-phoneme") model = AutoModelForCTC.from_pretrained("speech31/XLS-R-english-phoneme")
Training Details
Training Data
This model is fine-tuned on the TIMIT dataset. (Can be downloaded from https://catalog.ldc.upenn.edu/LDC93s1)
Preprocessing
The dataset was preprocessed using Epitran for transliterating text into IPA.
- Downloads last month
- 9