Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

anthonym21
/
json-tokenizer-structured

Text Generation
Transformers
English
tokenizer
json
bpe
structured-data
llm
Model card Files Files and versions
xet
Community
json-tokenizer-structured
594 kB
  • 1 contributor
History: 8 commits
anthonym21's picture
anthonym21
Update tokenizer: 73K training objects, 125 keys, DOI 10.5281/zenodo.18879110
f876778 verified 1 day ago
  • json_tokenizer
    Upload json_tokenizer/__init__.py with huggingface_hub 1 day ago
  • native_format
    Upload folder using huggingface_hub 1 day ago
  • .gitattributes
    1.52 kB
    initial commit 1 day ago
  • README.md
    4.1 kB
    Update tokenizer: 73K training objects, 125 keys, DOI 10.5281/zenodo.18879110 1 day ago
  • json_tokenizer_vocab.json
    289 kB
    Update tokenizer: 73K training objects, 125 keys, DOI 10.5281/zenodo.18879110 1 day ago
  • tokenizer_config.json
    963 Bytes
    Upload folder using huggingface_hub 1 day ago