deeplm-108m / tokenizer_config.json
samcheng0's picture
Upload tokenizer_config.json with huggingface_hub
7cff78c verified
raw
history blame contribute delete
254 Bytes
{
"vocab_size": 32000,
"model_max_length": 2048,
"tokenizer_class": "BPETokenizer",
"special_tokens": {
"pad_token": "<|pad|>",
"unk_token": "<|unk|>",
"bos_token": "<|begin_of_sentence|>",
"eos_token": "<|end_of_sentence|>"
}
}