tokenizer-128k / added_tokens.json
timpal0l's picture
Upload 128k vocab SentencePiece BPE tokenizer (trained on 175GB)
43f50d5 verified
{
"<pad>": 131072
}