Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Hanbaike
/
kyrgyz_spm_tokenizer
like
0
Kyrgyz
kyrgyz
tokenization
sentencepiece
BPE
Unigram
License:
mit
Model card
Files
Files and versions
xet
Community
Copy to bucket
new
main
kyrgyz_spm_tokenizer
/
text
442 MB
Ctrl+K
Ctrl+K
1 contributor
History:
1 commit
Hanbaike
Upload folder using huggingface_hub
059fc10
verified
about 1 year ago
kir_community_2017-sentences.txt
Safe
55.9 MB
xet
Upload folder using huggingface_hub
about 1 year ago
kir_newscrawl_2011_300K-sentences.txt
Safe
59 MB
xet
Upload folder using huggingface_hub
about 1 year ago
kir_newscrawl_2016_1M-sentences.txt
Safe
211 MB
xet
Upload folder using huggingface_hub
about 1 year ago
kir_wikipedia_2010_10K-sentences.txt
Safe
1.95 MB
Upload folder using huggingface_hub
about 1 year ago
kir_wikipedia_2016_300K-sentences.txt
Safe
57.9 MB
xet
Upload folder using huggingface_hub
about 1 year ago
kir_wikipedia_2021_300K-sentences.txt
Safe
56.3 MB
xet
Upload folder using huggingface_hub
about 1 year ago