Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
DiscreteSpeech
/
DSTK
like
8
Follow
Discrete Speech Project
6
English
Chinese
speech
tokenization
detokenization
text2token
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Copy to bucket
new
main
DSTK
/
thirdparty
/
G2P
/
text
/
zh_normalization
95.1 kB
Ctrl+K
Ctrl+K
1 contributor
History:
1 commit
gooorillax
first push of codes and models for g2p, t2u, tokenizer and detokenizer
cd8454d
9 months ago
README.md
Safe
1.39 kB
first push of codes and models for g2p, t2u, tokenizer and detokenizer
9 months ago
__init__.py
664 Bytes
first push of codes and models for g2p, t2u, tokenizer and detokenizer
9 months ago
char_convert.py
Safe
66.1 kB
first push of codes and models for g2p, t2u, tokenizer and detokenizer
9 months ago
chronology.py
Safe
3.62 kB
first push of codes and models for g2p, t2u, tokenizer and detokenizer
9 months ago
constants.py
Safe
2.46 kB
first push of codes and models for g2p, t2u, tokenizer and detokenizer
9 months ago
num.py
9.42 kB
first push of codes and models for g2p, t2u, tokenizer and detokenizer
9 months ago
phonecode.py
Safe
1.99 kB
first push of codes and models for g2p, t2u, tokenizer and detokenizer
9 months ago
quantifier.py
Safe
1.8 kB
first push of codes and models for g2p, t2u, tokenizer and detokenizer
9 months ago
text_normlization.py
7.6 kB
first push of codes and models for g2p, t2u, tokenizer and detokenizer
9 months ago