Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
JamesQuartz
/
QT_V.3_32K_UltraLingo
like
0
71 languages
tokenizers
tokenizer
multilingual
superbpe
bpe
byte-level
quartz
aenea
ultralingo
arxiv:
2503.13423
arxiv:
2305.14201
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
main
QT_V.3_32K_UltraLingo
5.44 MB
Ctrl+K
Ctrl+K
1 contributor
History:
7 commits
JamesQuartz
Upload README.md with huggingface_hub
b702ea3
verified
19 days ago
.gitattributes
Safe
1.52 kB
initial commit
19 days ago
README.md
8.3 kB
Upload README.md with huggingface_hub
19 days ago
merges.txt
356 kB
Upload merges.txt with huggingface_hub
19 days ago
tokenizer.json
2.4 MB
Upload tokenizer.json with huggingface_hub
19 days ago
tokenizer_stage1.json
2.1 MB
Upload tokenizer_stage1.json with huggingface_hub
19 days ago
training_report.json
5.86 kB
Upload training_report.json with huggingface_hub
19 days ago
vocab.json
571 kB
Upload vocab.json with huggingface_hub
19 days ago