tgsc
/

BPE-tokenizer-32k-CLS-SEP-post

BPE-tokenizer-32k-CLS-SEP-post / tokenizer_config.json

Upload tokenizer

86b7fa5 over 2 years ago

196 Bytes

	{
	"clean_up_tokenization_spaces": true,
	"model_max_length": 512,
	"special_tokens": [
	"[PAD]",
	"[SEP]",
	"[CLS]",
	"[UNK]"
	],
	"tokenizer_class": "PreTrainedTokenizerFast"
	}