trixyL commited on
Commit
08fb3c4
·
verified ·
1 Parent(s): aeb4ceb

add: tokenizer config

Browse files
Files changed (3) hide show
  1. merges.txt +0 -0
  2. special_tokens.json +5 -0
  3. vocab.json +0 -0
merges.txt ADDED
The diff for this file is too large to render. See raw diff
 
special_tokens.json ADDED
@@ -0,0 +1,5 @@
 
 
 
 
 
 
1
+ {
2
+ "<|startoftext|>": 50256,
3
+ "<|endoftext|>": 50257,
4
+ "<|mdm_mask|>": 50258,
5
+ }
vocab.json ADDED
The diff for this file is too large to render. See raw diff