Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
ByteSpanTokenisers
/
tokenizers
like
0
Follow
ByteSpan Tokenisers
4
Model card
Files
Files and versions
xet
Community
cab11c3
tokenizers
/
frequencymulti_64000
8.9 MB
3 contributors
History:
2 commits
Zeb
Remove ngram tokenizers
7487a89
9 months ago
merges.txt
679 kB
Fix multilingual frequency
9 months ago
merges_data.csv
1.58 MB
Fix multilingual frequency
9 months ago
special_tokens_map.json
65 Bytes
Remove ngram tokenizers
9 months ago
tokenizer.json
4.73 MB
Fix multilingual frequency
9 months ago
tokenizer_config.json
685 Bytes
Remove ngram tokenizers
9 months ago
vocab.json
1.91 MB
Fix multilingual frequency
9 months ago