Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
DiscreteSpeech
/
DSTK
like
8
Follow
Discrete Speech Project
6
English
Chinese
speech
tokenization
detokenization
text2token
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
main
DSTK
/
thirdparty
/
G2P
37.1 MB
1 contributor
History:
3 commits
gooorillax
refine readme, add logo, and fix a punct normalization problem in tn
bdecca1
4 months ago
pypinyin
first push of codes and models for g2p, t2u, tokenizer and detokenizer
4 months ago
text
first push of codes and models for g2p, t2u, tokenizer and detokenizer
4 months ago
whitelist
add example codes in README, refine README, add DSTK to whitelist
4 months ago
G2P_processors.py
9.09 kB
first push of codes and models for g2p, t2u, tokenizer and detokenizer
4 months ago
README.md
192 Bytes
first push of codes and models for g2p, t2u, tokenizer and detokenizer
4 months ago
TN_processors.py
7.38 kB
refine readme, add logo, and fix a punct normalization problem in tn
4 months ago
__init__.py
0 Bytes
first push of codes and models for g2p, t2u, tokenizer and detokenizer
4 months ago
patch_for_deps.sh
361 Bytes
add example codes in README, refine README, add DSTK to whitelist
4 months ago
requirements.txt
126 Bytes
first push of codes and models for g2p, t2u, tokenizer and detokenizer
4 months ago