Model Card for Model ID

Model Details

Model Description

  • Developed by: Eunjung Yeo
  • Model type: phone recognizer
  • Language(s) (SLP): English
  • Finetuned from model: XLS-R-300m

Direct Use

  • Phone recognition

Downstream Use [optional]

  • Analysis of phonetic transcriptions
  • L2 Pronunciation Assessment (Mispronunciation Detection and Diagnosis)
  • Mispronunciation Assessment for pathological speech

How to Get Started with the Model

from transformers import AutoProcessor, AutoModelForCTC

processor = AutoProcessor.from_pretrained("speech31/XLS-R-english-phoneme") model = AutoModelForCTC.from_pretrained("speech31/XLS-R-english-phoneme")

Training Details

Training Data

This model is fine-tuned on the TIMIT dataset. (Can be downloaded from https://catalog.ldc.upenn.edu/LDC93s1)

Preprocessing

The dataset was preprocessed using Epitran for transliterating text into IPA.

Downloads last month
9
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support