Update README.md
Browse files
README.md
CHANGED
|
@@ -20,7 +20,16 @@ tags:
|
|
| 20 |
pipeline_tag: automatic-speech-recognition
|
| 21 |
---
|
| 22 |
|
| 23 |
-
🐁POWSM-CTC
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 24 |
Its multi-task encoder-CTC structure is based on [OWSM-CTC](https://aclanthology.org/2024.acl-long.549/), and trained on [IPAPack++](https://huggingface.co/anyspeech), the same dataset as POWSM.
|
| 25 |
|
| 26 |
POWSM-CTC is proposed together with our paper [PRiSM](https://arxiv.org/abs/2601.14046), the first open-source benchmark for phone recognition systems.
|
|
|
|
| 20 |
pipeline_tag: automatic-speech-recognition
|
| 21 |
---
|
| 22 |
|
| 23 |
+
### 🐁POWSM-CTC
|
| 24 |
+
|
| 25 |
+
<p align="left">
|
| 26 |
+
<a href="https://arxiv.org/abs/2601.14046"><img src="https://img.shields.io/badge/Paper-2601.14046-red.svg?logo=arxiv&logoColor=red"/></a>
|
| 27 |
+
<a href="https://huggingface.co/espnet/powsm_ctc"><img src="https://img.shields.io/badge/Model-powsm_ctc-yellow.svg?logo=huggingface&logoColor=yellow"/></a>
|
| 28 |
+
<a href="https://github.com/changelinglab/prism"><img src="https://img.shields.io/badge/Benchmark-PRiSM-green.svg?logo=github&logoColor=black"/></a>
|
| 29 |
+
<a href="https://github.com/espnet/egs2/powsm_ctc/s2t1"><img src="https://img.shields.io/badge/Recipe-powsm_ctc-blue.svg?logo=github&logoColor=black"/></a>
|
| 30 |
+
</p>
|
| 31 |
+
|
| 32 |
+
POWSM-CTC is a variant of [POWSM](https://huggingface.co/espnet/powsm), the first phonetic foundation model that can perform four phone-related tasks.
|
| 33 |
Its multi-task encoder-CTC structure is based on [OWSM-CTC](https://aclanthology.org/2024.acl-long.549/), and trained on [IPAPack++](https://huggingface.co/anyspeech), the same dataset as POWSM.
|
| 34 |
|
| 35 |
POWSM-CTC is proposed together with our paper [PRiSM](https://arxiv.org/abs/2601.14046), the first open-source benchmark for phone recognition systems.
|