Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up

JamesQuartz
/
qt-V.4.6-32k

tokenizers
tokenizer
multilingual
bpe
superbpe
byte-level
quartz
prelude
Model card Files Files and versions
xet
Community
qt-V.4.6-32k
5.83 MB
Ctrl+K
Ctrl+K
  • 1 contributor
History: 12 commits
JamesQuartz's picture
JamesQuartz
Add flores_qt_v.4.6_release.json (QuartzTokenizer V.4.6 32K Prelude)
f080369 verified 4 days ago
  • .gitattributes
    1.52 kB
    initial commit 4 days ago
  • BENCHMARK.md
    4.03 kB
    Add BENCHMARK.md (QuartzTokenizer V.4.6 32K Prelude) 4 days ago
  • CHANGELOG.md
    2.67 kB
    Add CHANGELOG.md (QuartzTokenizer V.4.6 32K Prelude) 4 days ago
  • README.md
    7.36 kB
    Add README.md (QuartzTokenizer V.4.6 32K Prelude) 4 days ago
  • flores_qt_v.4.6_release.json
    139 kB
    Add flores_qt_v.4.6_release.json (QuartzTokenizer V.4.6 32K Prelude) 4 days ago
  • merges.txt
    416 kB
    Add merges.txt (QuartzTokenizer V.4.6 32K Prelude) 4 days ago
  • qt_v.4.6_per_script_bpt.md
    1.69 kB
    Add qt_v.4.6_per_script_bpt.md (QuartzTokenizer V.4.6 32K Prelude) 4 days ago
  • shard_log.json
    204 Bytes
    Add shard_log.json (QuartzTokenizer V.4.6 32K Prelude) 4 days ago
  • tokenizer.json
    2.52 MB
    Add tokenizer.json (QuartzTokenizer V.4.6 32K Prelude) 4 days ago
  • tokenizer_stage1.json
    2.1 MB
    Add tokenizer_stage1.json (QuartzTokenizer V.4.6 32K Prelude) 4 days ago
  • training_report.json
    9.67 kB
    Add training_report.json (QuartzTokenizer V.4.6 32K Prelude) 4 days ago
  • vocab.json
    631 kB
    Add vocab.json (QuartzTokenizer V.4.6 32K Prelude) 4 days ago