Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

versae
/
scandinavian-tokenizer

Model card Files Files and versions
xet
Community
scandinavian-tokenizer / texts
33.1 GB
  • 1 contributor
History: 1 commit
versae's picture
versae
Scandi+English tokenizer on OSCAR
e5b62ac almost 2 years ago
  • all.txt
    16.6 GB
    xet
    Scandi+English tokenizer on OSCAR almost 2 years ago
  • da.opening.txt
    3.26 GB
    xet
    Scandi+English tokenizer on OSCAR almost 2 years ago
  • en.opening.txt
    6.96 GB
    xet
    Scandi+English tokenizer on OSCAR almost 2 years ago
  • nn.opening.txt
    87.1 MB
    xet
    Scandi+English tokenizer on OSCAR almost 2 years ago
  • nn.opening.wiki.txt
    113 MB
    xet
    Scandi+English tokenizer on OSCAR almost 2 years ago
  • no.opening.txt
    2.4 GB
    xet
    Scandi+English tokenizer on OSCAR almost 2 years ago
  • sv.opening.txt
    3.73 GB
    xet
    Scandi+English tokenizer on OSCAR almost 2 years ago