File size: 846 Bytes
ae83fd1 99e8549 c500113 8460602 6818f3c c500113 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 |
---
license: apache-2.0
language:
- en
base_model:
- facebook/wav2vec2-large-xlsr-53
tags:
- phone
- speech
- recognition
- british
---
# GBPhone: British English Phone Recognizer
GBPhone is a phone recognizer trained for British English and producing [SAMPA](https://en.wikipedia.org/wiki/SAMPA_chart_for_English) phone symbols.
GBPhone was fine tuned from the [wav2vec2 XLSR](https://huggingface.co/docs/transformers/en/model_doc/xlsr_wav2vec2) model using a British English dataset.
An example Python script is included. Output is a CSV file with log likelihoods per phone per frame.
Because the model is trained by CTC, each phone is marked only at the start of each segment, and the blank symbol (blk) is used to pad the rest of the segment.
An example R script is included to display the recognition results.
Mark Huckvale
March 2025 |