Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
versae
/
scandinavian-tokenizer
like
0
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
main
scandinavian-tokenizer
/
texts
33.1 GB
1 contributor
History:
1 commit
versae
Scandi+English tokenizer on OSCAR
e5b62ac
almost 2 years ago
all.txt
16.6 GB
xet
Scandi+English tokenizer on OSCAR
almost 2 years ago
da.opening.txt
3.26 GB
xet
Scandi+English tokenizer on OSCAR
almost 2 years ago
en.opening.txt
6.96 GB
xet
Scandi+English tokenizer on OSCAR
almost 2 years ago
nn.opening.txt
87.1 MB
xet
Scandi+English tokenizer on OSCAR
almost 2 years ago
nn.opening.wiki.txt
113 MB
xet
Scandi+English tokenizer on OSCAR
almost 2 years ago
no.opening.txt
2.4 GB
xet
Scandi+English tokenizer on OSCAR
almost 2 years ago
sv.opening.txt
3.73 GB
xet
Scandi+English tokenizer on OSCAR
almost 2 years ago