Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
eljanmahammadli
/
turkic-tokenizers
like
0
5 languages
sentencepiece
tokenizers
bpe
turkic
turkish
azerbaijani
kazakh
uzbek
kyrgyz
multilingual
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Copy to bucket
new
main
turkic-tokenizers
/
tokenizers
/
shared
4.91 MB
Ctrl+K
Ctrl+K
1 contributor
History:
1 commit
eljanmahammadli
Initial release: 24 SentencePiece tokenizers for Turkic languages
00c3a73
verified
5 days ago
spm_16000.meta.json
1.03 kB
Initial release: 24 SentencePiece tokenizers for Turkic languages
5 days ago
spm_16000.model
484 kB
xet
Initial release: 24 SentencePiece tokenizers for Turkic languages
5 days ago
spm_16000.vocab
217 kB
Initial release: 24 SentencePiece tokenizers for Turkic languages
5 days ago
spm_32000.meta.json
1.03 kB
Initial release: 24 SentencePiece tokenizers for Turkic languages
5 days ago
spm_32000.model
778 kB
xet
Initial release: 24 SentencePiece tokenizers for Turkic languages
5 days ago
spm_32000.vocab
495 kB
Initial release: 24 SentencePiece tokenizers for Turkic languages
5 days ago
spm_64000.meta.json
1.03 kB
Initial release: 24 SentencePiece tokenizers for Turkic languages
5 days ago
spm_64000.model
1.4 MB
xet
Initial release: 24 SentencePiece tokenizers for Turkic languages
5 days ago
spm_64000.vocab
1.09 MB
Initial release: 24 SentencePiece tokenizers for Turkic languages
5 days ago
spm_8000.meta.json
1.03 kB
Initial release: 24 SentencePiece tokenizers for Turkic languages
5 days ago
spm_8000.model
348 kB
xet
Initial release: 24 SentencePiece tokenizers for Turkic languages
5 days ago
spm_8000.vocab
90 kB
Initial release: 24 SentencePiece tokenizers for Turkic languages
5 days ago