Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
espnet
/
powsm
like
9
Follow
ESPnet
336
Automatic Speech Recognition
ESPnet
4 datasets
multilingual
audio
phone-recognition
grapheme-to-phoneme
phoneme-to-grapheme
arxiv:
2510.24992
arxiv:
2601.14046
License:
cc-by-4.0
Model card
Files
Files and versions
xet
Community
1
Use this model
c02412d
powsm
2.75 GB
1 contributor
History:
17 commits
cjli
Create meta.yaml
c02412d
verified
30 days ago
data
add model files
4 months ago
exp
add train/feats_stats.npz
4 months ago
textnorm_retrained
Create meta.yaml
30 days ago
.gitattributes
1.52 kB
initial commit
4 months ago
README.md
3.45 kB
update arxiv
4 months ago
meta.yaml
341 Bytes
patch yaml file
4 months ago