xlm-roberta-base-uk / README.md
ukr-models's picture
Create README.md
6cf5dfd
|
raw
history blame
614 Bytes
---
language:
- uk
tags:
- ukrainian
widget:
- text: "Тарас Шевченко – великий український <mask>."
license: mit
---
This is a smaller version of the [XLM-RoBERTa](https://huggingface.co/xlm-roberta-base) model with only Ukrainian and some English embeddings left.
* The original model has 470M parameters, with 384M of them being input and output embeddings.
* After shrinking the `sentencepiece` vocabulary from 250K to 31K (top 25K Ukrainian tokens and top English tokens) the number of model parameters reduced to 134M parameters, and model size reduced from 1GB to 400MB.