ibraheemmoosa
commited on
Commit
·
2725cd9
1
Parent(s):
0aece10
Update README.md
Browse files
README.md
CHANGED
@@ -42,7 +42,7 @@ co2_eq_emissions:
|
|
42 |
|
43 |
# XLMIndic Base Uniscript
|
44 |
|
45 |
-
|
46 |
library. A demo of Aksharamukha library is hosted [here](https://aksharamukha.appspot.com/converter)
|
47 |
where you can transliterate your text and use it on our model on the inference widget.
|
48 |
|
|
|
42 |
|
43 |
# XLMIndic Base Uniscript
|
44 |
|
45 |
+
This model is pretrained on a subset of the [OSCAR](https://huggingface.co/datasets/oscar) corpus spanning 14 Indo-Aryan languages. Before pretraining this model we transliterate the text to [ISO-15919](https://en.wikipedia.org/wiki/ISO_15919) format using the [Aksharamukha](https://pypi.org/project/aksharamukha/)
|
46 |
library. A demo of Aksharamukha library is hosted [here](https://aksharamukha.appspot.com/converter)
|
47 |
where you can transliterate your text and use it on our model on the inference widget.
|
48 |
|