license: odc-by | |
language: | |
- ar | |
- cy | |
- de | |
- en | |
- es | |
- fr | |
- id | |
- it | |
- ru | |
- sw | |
This is a raw, pretrained multilingual language model, supporting Arabic, Welsh, German, English, Spanish, French, Indonesian, Italian, Russian, and Swahili. | |
The model is pretrained from scratch, which should be further finetuned for most use cases. | |
For more details: | |
[Multilingual Language Model Pretraining using Machine-translated Data](https://arxiv.org/abs/2502.13252) | |
**Contact** | |
Email: [[email protected]](mailto:[email protected]) |