Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
latincy
/
latin-bert
like
0
Follow
LatinCy
22
Fill-Mask
Transformers
PyTorch
Safetensors
Latin
bert
feature-extraction
latin
nlp
classics
arxiv:
2009.10053
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
1
Deploy
Use this model
refs/pr/1
latin-bert
/
src
/
latincy_latinbert
11.7 kB
Ctrl+K
Ctrl+K
3 contributors
History:
2 commits
diyclassics
Fix tokenizer ID offset: reserve IDs 0-4 for BERT special tokens
ce59834
about 2 months ago
__init__.py
Safe
177 Bytes
Initial: HF-compatible Latin BERT tokenizer (Bamman & Burns 2020)
about 2 months ago
tokenization_latin_bert.py
Safe
11.2 kB
Fix tokenizer ID offset: reserve IDs 0-4 for BERT special tokens
about 2 months ago
tokenizer_config.json
Safe
315 Bytes
Fix tokenizer ID offset: reserve IDs 0-4 for BERT special tokens
about 2 months ago