Update readme
Browse files
README.md
CHANGED
@@ -20,7 +20,7 @@ widget:
|
|
20 |
# Code-Mixed Language Detection using XLM-RoBERTa
|
21 |
|
22 |
## Description
|
23 |
-
This model detects
|
24 |
|
25 |
## Training Dataset
|
26 |
The training dataset is based on [The Multilingual Amazon Reviews Corpus](https://huggingface.co/datasets/amazon_reviews_multi). The preprocessed dataset can be found [here](https://huggingface.co/datasets/msislam/marc-code-mixed-small).
|
|
|
20 |
# Code-Mixed Language Detection using XLM-RoBERTa
|
21 |
|
22 |
## Description
|
23 |
+
This model detects languages in a text (Code-Mixed text) with their boundaries by classifying each token. Currently, it supports German (DE), English (EN), Spanish (ES), and French (FR) languages. The model is fine-tuned on [xlm-roberta-base](https://huggingface.co/xlm-roberta-base).
|
24 |
|
25 |
## Training Dataset
|
26 |
The training dataset is based on [The Multilingual Amazon Reviews Corpus](https://huggingface.co/datasets/amazon_reviews_multi). The preprocessed dataset can be found [here](https://huggingface.co/datasets/msislam/marc-code-mixed-small).
|