msislam
/

code-mixed-language-detection-XLMRoberta

Token Classification

Inference Endpoints

Model card Files Files and versions Community

msislam commited on Jul 2, 2023

Commit

251d5fe

•

1 Parent(s): 3b2c451

Update readme

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -20,7 +20,7 @@ widget:
 # Code-Mixed Language Detection using XLM-RoBERTa
 ## Description
-This model detects Languages with its boundary by classifying each token. Currently, it supports German (DE), English (EN), Spanish (ES), and French (FR) languages. The model is fine-tuned on [xlm-roberta-base](https://huggingface.co/xlm-roberta-base).
 ## Training Dataset
 The training dataset is based on [The Multilingual Amazon Reviews Corpus](https://huggingface.co/datasets/amazon_reviews_multi). The preprocessed dataset can be found [here](https://huggingface.co/datasets/msislam/marc-code-mixed-small).

 # Code-Mixed Language Detection using XLM-RoBERTa
 ## Description
+This model detects languages in a text (Code-Mixed text) with their boundaries by classifying each token. Currently, it supports German (DE), English (EN), Spanish (ES), and French (FR) languages. The model is fine-tuned on [xlm-roberta-base](https://huggingface.co/xlm-roberta-base).
 ## Training Dataset
 The training dataset is based on [The Multilingual Amazon Reviews Corpus](https://huggingface.co/datasets/amazon_reviews_multi). The preprocessed dataset can be found [here](https://huggingface.co/datasets/msislam/marc-code-mixed-small).