Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

itwk
/
mc4_6000_10M_nonwaka

Model card Files Files and versions
xet
Community
mc4_6000_10M_nonwaka
1.05 MB
  • 1 contributor
History: 2 commits
itwk's picture
itwk
Upload 7 files
ae88e64 verified over 1 year ago
  • .gitattributes
    1.52 kB
    initial commit over 1 year ago
  • special_tokens_map.json
    73 Bytes
    Upload 7 files over 1 year ago
  • spiece.model
    88.8 kB
    xet
    Upload 7 files over 1 year ago
  • spiece.vocab
    91.8 kB
    Upload 7 files over 1 year ago
  • token_fraction.csv
    493 kB
    Upload 7 files over 1 year ago
  • token_fraction_describe.txt
    768 Bytes
    Upload 7 files over 1 year ago
  • tokenizer.json
    368 kB
    Upload 7 files over 1 year ago
  • tokenizer_config.json
    925 Bytes
    Upload 7 files over 1 year ago