Using Tokenizer

by pushpdeep - opened Apr 19, 2023

Discussion

pushpdeep

Apr 19, 2023

•

edited Apr 19, 2023

I ran to problem while using :

tokenizer = AutoTokenizer.from_pretrained('ai4bharat/indic-bert')

I was working in colab, upon searching for the solution , I found that sentencepiece need to be installed. I uninstalled transformers and installed it like :

!pip install --no-cache-dir transformers sentencepiece

and then loaded the tokenizer using :

tokenizer = AutoTokenizer.from_pretrained('ai4bharat/indic-bert', use_fast=False)

Hope this helps !

pushpdeep changed discussion status to closed Apr 19, 2023

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment