Andrey Kutuzov
commited on
Commit
·
acc64ce
1
Parent(s):
4a0ae9c
Discarded basic tokenization to better fit our vocabulary
Browse files- tokenizer_config.json +2 -1
tokenizer_config.json
CHANGED
@@ -1,3 +1,4 @@
|
|
1 |
{
|
2 |
-
"do_lower_case": false
|
|
|
3 |
}
|
|
|
1 |
{
|
2 |
+
"do_lower_case": false,
|
3 |
+
"do_basic_tokenize": false
|
4 |
}
|