semantic-evm-mlm-chkp1000 / tokenizer_config.json
somehowchris's picture
Upload folder using huggingface_hub
d74b88f verified
raw
history blame contribute delete
263 Bytes
{
"tokenizer_class": "PreTrainedTokenizerFast",
"vocab_size": 32768,
"pad_token": "[PAD]",
"unk_token": "[UNK]",
"cls_token": "[CLS]",
"sep_token": "[SEP]",
"mask_token": "[MASK]",
"model_max_length": 512,
"clean_up_tokenization_spaces": false
}