Oriya
sentencepiece
OdiaTokenizer / tokenizer_config.json
shantipriya's picture
Create tokenizer_config.json
ca15fcc verified
raw
history blame contribute delete
245 Bytes
{
"model_max_length": 512,
"do_lower_case": false,
"unk_token": "<unk>",
"bos_token": "<s>",
"eos_token": "</s>",
"pad_token": "<pad>",
"tokenizer_class": "PreTrainedTokenizerFast",
"sp_model_file": "odia_tokenizers_test.model"
}