openllm-small-extended-7k / tokenizer_config.json
lemms's picture
Add OpenLLM Small Extended 7k model
bc52965 verified
{
"tokenizer_class": "SentencePieceTokenizer",
"model_max_length": 1024,
"vocab_size": 32000,
"unk_token": "<unk>",
"bos_token": "<s>",
"eos_token": "</s>",
"pad_token": "<pad>"
}