Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Kashif786
/
gemma-7b-it-sindhi-tokenizer

Transformers
Sindhi
English
sindhi
nlp
tokenizer-extension
gemma
low-resource-languages
unigram
Model card Files Files and versions
xet
Community
gemma-7b-it-sindhi-tokenizer
37.8 MB
  • 1 contributor
History: 3 commits
Kashif786's picture
Kashif786
Update README.md
f7b4b6b verified 14 days ago
  • .gitattributes
    1.57 kB
    Add 20k Sindhi unigram tokens to Gemma-7B-it base for thesis research 14 days ago
  • README.md
    2.48 kB
    Update README.md 14 days ago
  • chat_template.jinja
    591 Bytes
    Add 20k Sindhi unigram tokens to Gemma-7B-it base for thesis research 14 days ago
  • tokenizer.json
    37.8 MB
    xet
    Add 20k Sindhi unigram tokens to Gemma-7B-it base for thesis research 14 days ago
  • tokenizer_config.json
    489 Bytes
    Add 20k Sindhi unigram tokens to Gemma-7B-it base for thesis research 14 days ago