Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

karakaka
/
statement-pydec-dataset-tokenized

Model card Files Files and versions
xet
Community
statement-pydec-dataset-tokenized
669 kB
  • 1 contributor
History: 12 commits
karakaka's picture
karakaka
Extracted tokenizer
cd47e55 verified 3 months ago
  • .gitattributes
    1.52 kB
    initial commit 3 months ago
  • merges.txt
    20 kB
    Extracted merges from tokenizer 3 months ago
  • special_tokens_map.json
    30 kB
    Extracted special tokens map 3 months ago
  • tokenizer.json
    383 kB
    Extracted tokenizer 3 months ago
  • tokenizer_config.json
    181 kB
    Trained tokenizer using karakaka/statement-pydec-dataset 3 months ago
  • vocab.json
    53.6 kB
    Extracted vocabulary from tokenizer 3 months ago