Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
RA-ALTA
/
tokenizer-zh
like
0
Follow
RA at ALTA
4
Model card
Files
Files and versions
xet
Community
main
tokenizer-zh
1 contributor
History:
21 commits
suchirsalhan
Final Rescue: Full vocab restoration with Unigram scores and Metaspace fix
24cc4f3
verified
5 days ago
.gitattributes
Safe
1.52 kB
initial commit
10 days ago
chat_template.jinja
Safe
199 Bytes
Fix: Environment issues resolved, full Fast Tokenizer uploaded.
5 days ago
special_tokens_map.json
Safe
95 Bytes
Upload folder using huggingface_hub
7 days ago
spm.model
1.03 MB
xet
Upload folder using huggingface_hub
10 days ago
spm.vocab
Safe
727 kB
Upload folder using huggingface_hub
10 days ago
tokenizer.json
2.63 MB
Final Rescue: Full vocab restoration with Unigram scores and Metaspace fix
5 days ago
tokenizer_config.json
Safe
261 Bytes
Final Fix: Correct Metaspace mapping and Unigram scores
5 days ago