Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
DiscreteSpeech
/
DSTK
like
8
Follow
Discrete Speech Project
6
English
Chinese
speech
tokenization
detokenization
text2token
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Copy to bucket
new
main
DSTK
/
thirdparty
/
G2P
/
text
34.5 MB
Ctrl+K
Ctrl+K
1 contributor
History:
1 commit
gooorillax
first push of codes and models for g2p, t2u, tokenizer and detokenizer
cd8454d
9 months ago
g2pw
first push of codes and models for g2p, t2u, tokenizer and detokenizer
9 months ago
ja_userdic
first push of codes and models for g2p, t2u, tokenizer and detokenizer
9 months ago
zh_normalization
first push of codes and models for g2p, t2u, tokenizer and detokenizer
9 months ago
.gitignore
27 Bytes
first push of codes and models for g2p, t2u, tokenizer and detokenizer
9 months ago
__init__.py
886 Bytes
first push of codes and models for g2p, t2u, tokenizer and detokenizer
9 months ago
cantonese.py
5.28 kB
first push of codes and models for g2p, t2u, tokenizer and detokenizer
9 months ago
chinese.py
6.46 kB
first push of codes and models for g2p, t2u, tokenizer and detokenizer
9 months ago
chinese2.py
10.5 kB
first push of codes and models for g2p, t2u, tokenizer and detokenizer
9 months ago
cleaner.py
3.29 kB
first push of codes and models for g2p, t2u, tokenizer and detokenizer
9 months ago
cmudict-fast.rep
Safe
3.61 MB
first push of codes and models for g2p, t2u, tokenizer and detokenizer
9 months ago
cmudict.rep
Safe
3.73 MB
first push of codes and models for g2p, t2u, tokenizer and detokenizer
9 months ago
engdict-hot.rep
Safe
75 Bytes
first push of codes and models for g2p, t2u, tokenizer and detokenizer
9 months ago
engdict_cache.pickle
Suspicious
pickle
Pickle imports
No problematic imports detected
What is a pickle import?
5.97 MB
xet
first push of codes and models for g2p, t2u, tokenizer and detokenizer
9 months ago
english.py
10.8 kB
first push of codes and models for g2p, t2u, tokenizer and detokenizer
9 months ago
japanese.py
7.62 kB
first push of codes and models for g2p, t2u, tokenizer and detokenizer
9 months ago
korean.py
7.99 kB
first push of codes and models for g2p, t2u, tokenizer and detokenizer
9 months ago
namedict_cache.pickle
Safe
pickle
Pickle imports
No problematic imports detected
What is a pickle import?
761 kB
xet
first push of codes and models for g2p, t2u, tokenizer and detokenizer
9 months ago
opencpop-strict.txt
Safe
4.08 kB
first push of codes and models for g2p, t2u, tokenizer and detokenizer
9 months ago
symbols.py
4.56 kB
first push of codes and models for g2p, t2u, tokenizer and detokenizer
9 months ago
symbols2.py
8.31 kB
first push of codes and models for g2p, t2u, tokenizer and detokenizer
9 months ago
tone_sandhi.py
24.5 kB
first push of codes and models for g2p, t2u, tokenizer and detokenizer
9 months ago