Andrey Kutuzov
commited on
Commit
·
acc64ce
1
Parent(s):
4a0ae9c
Discarded basic tokenization to better fit our vocabulary
Browse files- tokenizer_config.json +2 -1
tokenizer_config.json
CHANGED
|
@@ -1,3 +1,4 @@
|
|
| 1 |
{
|
| 2 |
-
"do_lower_case": false
|
|
|
|
| 3 |
}
|
|
|
|
| 1 |
{
|
| 2 |
+
"do_lower_case": false,
|
| 3 |
+
"do_basic_tokenize": false
|
| 4 |
}
|