OdinNext-138M-Base / tokenizer_config.json
joelhenwang's picture
OdinNext-138M-Base: EMA weights (101.6B-token dolmino base)
0ef192a verified
{
"tokenizer_class": "PreTrainedTokenizerFast",
"model_max_length": 2048,
"bos_token": "<|endoftext|>",
"eos_token": "<|endoftext|>",
"pad_token": "<|pad|>",
"clean_up_tokenization_spaces": false
}