Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

wikilangs
/
bbc

Feature Extraction
Batak Toba
wikilangs
nlp
tokenizer
embeddings
n-gram
markov
wikipedia
monolingual
family-austronesian_batak
Model card Files Files and versions
xet
Community
bbc / models /tokenizer
5.27 MB
  • 1 contributor
History: 1 commit
omarkamali's picture
omarkamali
Upload all models and assets for bbc (20251201)
1e6f4ab verified about 20 hours ago
  • bbc_tokenizer_16k.model
    526 kB
    xet
    Upload all models and assets for bbc (20251201) about 20 hours ago
  • bbc_tokenizer_16k.vocab
    252 kB
    Upload all models and assets for bbc (20251201) about 20 hours ago
  • bbc_tokenizer_32k.model
    829 kB
    xet
    Upload all models and assets for bbc (20251201) about 20 hours ago
  • bbc_tokenizer_32k.vocab
    539 kB
    Upload all models and assets for bbc (20251201) about 20 hours ago
  • bbc_tokenizer_64k.model
    1.48 MB
    xet
    Upload all models and assets for bbc (20251201) about 20 hours ago
  • bbc_tokenizer_64k.vocab
    1.15 MB
    Upload all models and assets for bbc (20251201) about 20 hours ago
  • bbc_tokenizer_8k.model
    380 kB
    xet
    Upload all models and assets for bbc (20251201) about 20 hours ago
  • bbc_tokenizer_8k.vocab
    116 kB
    Upload all models and assets for bbc (20251201) about 20 hours ago