oscar_13_languages_equal_weight / tokenizer_config.json
Teven Le Scao
add tokenizer
a20f06a
raw
history blame contribute delete
153 Bytes
{"unk_token": "<|endoftext|>", "bos_token": "<|endoftext|>", "eos_token": "<|endoftext|>", "add_prefix_space": false, "tokenizer_class": "GPT2Tokenizer"}