Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
DiscreteSpeech
/
DSTK
like
8
Follow
Discrete Speech Project
6
English
Chinese
speech
tokenization
detokenization
text2token
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
main
DSTK
/
thirdparty
/
G2P
37.1 MB
Ctrl+K
Ctrl+K
1 contributor
History:
3 commits
gooorillax
refine readme, add logo, and fix a punct normalization problem in tn
bdecca1
8 months ago
pypinyin
first push of codes and models for g2p, t2u, tokenizer and detokenizer
8 months ago
text
first push of codes and models for g2p, t2u, tokenizer and detokenizer
8 months ago
whitelist
add example codes in README, refine README, add DSTK to whitelist
8 months ago
G2P_processors.py
Safe
9.09 kB
first push of codes and models for g2p, t2u, tokenizer and detokenizer
8 months ago
README.md
Safe
192 Bytes
first push of codes and models for g2p, t2u, tokenizer and detokenizer
8 months ago
TN_processors.py
7.38 kB
refine readme, add logo, and fix a punct normalization problem in tn
8 months ago
__init__.py
0 Bytes
first push of codes and models for g2p, t2u, tokenizer and detokenizer
8 months ago
patch_for_deps.sh
Safe
361 Bytes
add example codes in README, refine README, add DSTK to whitelist
8 months ago
requirements.txt
Safe
126 Bytes
first push of codes and models for g2p, t2u, tokenizer and detokenizer
8 months ago