gpt2-small-catalan-v2 / tokenizer_config.json
ClassCat's picture
add tokenizer
3fac4f7
raw
history blame contribute delete
325 Bytes
{"unk_token": "<|endoftext|>", "bos_token": "<|endoftext|>", "eos_token": "<|endoftext|>", "add_prefix_space": false, "keep_accents": true, "max_len": 50, "special_tokens_map_file": "classcat/gpt2-small-catalan-v2/special_tokens_map.json", "name_or_path": "classcat/gpt2-small-catalan-v2", "tokenizer_class": "GPT2Tokenizer"}