Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
espnet
/
powsm
like
12
Follow
ESPnet
351
Automatic Speech Recognition
ESPnet
4 datasets
multilingual
audio
phone-recognition
grapheme-to-phoneme
phoneme-to-grapheme
arxiv:
2510.24992
arxiv:
2601.14046
License:
cc-by-4.0
Model card
Files
Files and versions
xet
Community
1
Use this model
d358132
powsm
2.75 GB
Ctrl+K
Ctrl+K
1 contributor
History:
16 commits
cjli
Upload textnorm_retrained/exp/s2t_train_ctc3_conv2d_size768_e9_d9_mel128_raw_bpe40000/config.yaml with huggingface_hub
d358132
verified
3 months ago
data
add model files
6 months ago
exp
add train/feats_stats.npz
6 months ago
textnorm_retrained
Upload textnorm_retrained/exp/s2t_train_ctc3_conv2d_size768_e9_d9_mel128_raw_bpe40000/config.yaml with huggingface_hub
3 months ago
.gitattributes
Safe
1.52 kB
initial commit
6 months ago
README.md
Safe
3.45 kB
update arxiv
6 months ago
meta.yaml
Safe
341 Bytes
patch yaml file
6 months ago