File size: 542 Bytes
497d967 2cbbd96 497d967 2cbbd96 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 |
---
license: odc-by
language:
- ar
- cy
- de
- en
- es
- fr
- id
- it
- ru
- sw
---
This is a raw, pretrained multilingual language model, supporting Arabic, Welsh, German, English, Spanish, French, Indonesian, Italian, Russian, and Swahili.
The model is pretrained from scratch, which should be further finetuned for most use cases.
For more details:
[Multilingual Language Model Pretraining using Machine-translated Data](https://arxiv.org/abs/2502.13252)
**Contact**
Email: [[email protected]](mailto:[email protected]) |