Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up

hafeez007
/
balochi-tokenizers

Text Generation
Baluchi
English
sentencepiece
tokenizer
wordpiece
bpe
balochi
southern-balochi
low-resource-nlp
perso-arabic
nlp
gemma
bert
roberta
Model card Files Files and versions
xet
Community
balochi-tokenizers
76.6 MB
Ctrl+K
Ctrl+K
  • 1 contributor
History: 2 commits
hafeez007's picture
hafeez007
Update tokenizer models and README
e899795 verified 27 days ago
  • Ablation
    Update tokenizer models and README 27 days ago
  • Code
    Update tokenizer models and README 27 days ago
  • Models
    Update tokenizer models and README 27 days ago
  • Tokens
    Update tokenizer models and README 27 days ago
  • .gitattributes
    1.52 kB
    initial commit 27 days ago
  • Balochi_BPE_Tokenizer_64000.json
    5.03 MB
    Update tokenizer models and README 27 days ago
  • Balochi_Sentence_Piece_Tokenizer_64000.model
    1.53 MB
    xet
    Update tokenizer models and README 27 days ago
  • Balochi_Sentence_Piece_Tokenizer_Vocab_64000.vocab
    1.28 MB
    Update tokenizer models and README 27 days ago
  • Balochi_Word_Piece_Tokenizer_64000.json
    1.72 MB
    Update tokenizer models and README 27 days ago
  • Balochi_bpe_47000_tokenizer.json
    3.43 MB
    Update tokenizer models and README 27 days ago
  • Balochi_bpe_tokenizer_80000.json
    6.35 MB
    Update tokenizer models and README 27 days ago
  • Balochi_sentencepiece_47000_tokenizer.model
    1.19 MB
    xet
    Update tokenizer models and README 27 days ago
  • Balochi_sentencepiece_47000_tokenizer.vocab
    1.03 MB
    Update tokenizer models and README 27 days ago
  • Balochi_wordpiece_47000_tokenizer.json
    1.2 MB
    Update tokenizer models and README 27 days ago
  • README.md
    13 kB
    Update tokenizer models and README 27 days ago