ne_bert_tiny / README.md
jangedoo's picture
Update README.md
2b8e7b7 verified
metadata
library_name: transformers
tags: []

This is a tokenizer with same settings as bert-base-uncased but sets strip_accents=False.

Has a vocab size of 30K and was trained on various corpus including Nepali wikipedia, new articles etc.