Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
ByteSpanTokenisers
/
tokenizers
like
0
Follow
ByteSpan Tokenisers
4
Model card
Files
Files and versions
xet
Community
2e4b4eb
tokenizers
/
frequencymulti_64000
8.9 MB
3 contributors
History:
2 commits
Zeb
Remove ngram tokenizers
7487a89
8 months ago
merges.txt
Safe
679 kB
Fix multilingual frequency
8 months ago
merges_data.csv
Safe
1.58 MB
Fix multilingual frequency
8 months ago
special_tokens_map.json
Safe
65 Bytes
Remove ngram tokenizers
8 months ago
tokenizer.json
Safe
4.73 MB
Fix multilingual frequency
8 months ago
tokenizer_config.json
Safe
685 Bytes
Remove ngram tokenizers
8 months ago
vocab.json
Safe
1.91 MB
Fix multilingual frequency
8 months ago