datasets: | |
- oscar-corpus/OSCAR-2201 | |
- mc4 | |
language: | |
- he | |
## Hebrew Language Model | |
State-of-the-art RoBERTa language model for Hebrew. | |
#### How to use | |
'''python | |
from transformers import AutoModelForMaskedLM, AutoTokenizer | |
tokenizer = AutoTokenizer.from_pretrained('HeNLP/HeRo') | |
model = AutoModelForMaskedLM.from_pretrained('HeNLP/HeRo') | |
''' |