Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

latincy
/
latin-bert

Fill-Mask
Transformers
PyTorch
Safetensors
Latin
bert
feature-extraction
latin
nlp
classics
Model card Files Files and versions
xet
Community
1
latin-bert / src /latincy_latinbert
11.7 kB
Ctrl+K
Ctrl+K
  • 3 contributors
History: 2 commits
diyclassics's picture
diyclassics
Fix tokenizer ID offset: reserve IDs 0-4 for BERT special tokens
ce59834 about 2 months ago
  • __init__.py
    177 Bytes
    Initial: HF-compatible Latin BERT tokenizer (Bamman & Burns 2020) about 2 months ago
  • tokenization_latin_bert.py
    11.2 kB
    Fix tokenizer ID offset: reserve IDs 0-4 for BERT special tokens about 2 months ago
  • tokenizer_config.json
    315 Bytes
    Fix tokenizer ID offset: reserve IDs 0-4 for BERT special tokens about 2 months ago