Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

wikilangs
/
btm

Text Generation
fastText
Batak Mandailing
wikilangs
nlp
tokenizer
embeddings
n-gram
markov
wikipedia
feature-extraction
sentence-similarity
tokenization
n-grams
markov-chain
text-mining
babelvec
vocabulous
vocabulary
monolingual
family-austronesian_batak
Model card Files Files and versions
xet
Community
btm / models /tokenizer
4.83 MB
  • 1 contributor
History: 1 commit
omarkamali's picture
omarkamali
Upload all models and assets for btm (20251201)
9ab2d8e verified 20 days ago
  • btm_tokenizer_16k.model
    505 kB
    xet
    Upload all models and assets for btm (20251201) 20 days ago
  • btm_tokenizer_16k.vocab
    231 kB
    Upload all models and assets for btm (20251201) 20 days ago
  • btm_tokenizer_32k.model
    789 kB
    xet
    Upload all models and assets for btm (20251201) 20 days ago
  • btm_tokenizer_32k.vocab
    499 kB
    Upload all models and assets for btm (20251201) 20 days ago
  • btm_tokenizer_64k.model
    1.32 MB
    xet
    Upload all models and assets for btm (20251201) 20 days ago
  • btm_tokenizer_64k.vocab
    1 MB
    Upload all models and assets for btm (20251201) 20 days ago
  • btm_tokenizer_8k.model
    372 kB
    xet
    Upload all models and assets for btm (20251201) 20 days ago
  • btm_tokenizer_8k.vocab
    108 kB
    Upload all models and assets for btm (20251201) 20 days ago