KeyLM-75M / tokenizer_config.json
Eclipse-Senpai's picture
Add KeyLM-75M base model (bf16, from-scratch, ~18B tokens)
8dbce4f verified
raw
history blame contribute delete
286 Bytes
{
"bos_token": "<s>",
"eos_token": "</s>",
"lowercase": false,
"model_max_length": 2048,
"tokenizer_class": "PreTrainedTokenizerFast",
"unk_token": "[UNK]",
"vocab_size": 12020,
"add_bos_token": false,
"add_eos_token": false,
"clean_up_tokenization_spaces": false
}