Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
DiscreteSpeech
/
DSTK
like
8
Follow
Discrete Speech Project
6
English
Chinese
speech
tokenization
detokenization
text2token
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
main
DSTK
/
thirdparty
/
G2P
/
text
34.5 MB
1 contributor
History:
1 commit
gooorillax
first push of codes and models for g2p, t2u, tokenizer and detokenizer
cd8454d
4 months ago
g2pw
first push of codes and models for g2p, t2u, tokenizer and detokenizer
4 months ago
ja_userdic
first push of codes and models for g2p, t2u, tokenizer and detokenizer
4 months ago
zh_normalization
first push of codes and models for g2p, t2u, tokenizer and detokenizer
4 months ago
.gitignore
27 Bytes
first push of codes and models for g2p, t2u, tokenizer and detokenizer
4 months ago
__init__.py
886 Bytes
first push of codes and models for g2p, t2u, tokenizer and detokenizer
4 months ago
cantonese.py
5.28 kB
first push of codes and models for g2p, t2u, tokenizer and detokenizer
4 months ago
chinese.py
6.46 kB
first push of codes and models for g2p, t2u, tokenizer and detokenizer
4 months ago
chinese2.py
10.5 kB
first push of codes and models for g2p, t2u, tokenizer and detokenizer
4 months ago
cleaner.py
3.29 kB
first push of codes and models for g2p, t2u, tokenizer and detokenizer
4 months ago
cmudict-fast.rep
3.61 MB
first push of codes and models for g2p, t2u, tokenizer and detokenizer
4 months ago
cmudict.rep
3.73 MB
first push of codes and models for g2p, t2u, tokenizer and detokenizer
4 months ago
engdict-hot.rep
75 Bytes
first push of codes and models for g2p, t2u, tokenizer and detokenizer
4 months ago
engdict_cache.pickle
5.97 MB
xet
first push of codes and models for g2p, t2u, tokenizer and detokenizer
4 months ago
english.py
10.8 kB
first push of codes and models for g2p, t2u, tokenizer and detokenizer
4 months ago
japanese.py
7.62 kB
first push of codes and models for g2p, t2u, tokenizer and detokenizer
4 months ago
korean.py
7.99 kB
first push of codes and models for g2p, t2u, tokenizer and detokenizer
4 months ago
namedict_cache.pickle
761 kB
xet
first push of codes and models for g2p, t2u, tokenizer and detokenizer
4 months ago
opencpop-strict.txt
4.08 kB
first push of codes and models for g2p, t2u, tokenizer and detokenizer
4 months ago
symbols.py
4.56 kB
first push of codes and models for g2p, t2u, tokenizer and detokenizer
4 months ago
symbols2.py
8.31 kB
first push of codes and models for g2p, t2u, tokenizer and detokenizer
4 months ago
tone_sandhi.py
24.5 kB
first push of codes and models for g2p, t2u, tokenizer and detokenizer
4 months ago