--- license: apache-2.0 datasets: - risqaliyevds/uzbek_ner language: - uz metrics: - precision - f1 - recall - accuracy base_model: - FacebookAI/xlm-roberta-base pipeline_tag: token-classification --- # NER Model for Uzbek Language (XLM-RoBERTa-based) This is a Named Entity Recognition (NER) model trained for the Uzbek language based on the XLM-RoBERTa architecture. It is fine-tuned to classify entities into categories such as location, person, organization, and other types. ## Model Details - **Model Type**: XLM-RoBERTa (Transformer-based) - **Task**: Named Entity Recognition (NER) - **Training Data**: Custom dataset with labeled named entities for Uzbek language. - **Categories**: - `B-LOC` (Location) - `B-PERSON` (Person) - `B-ORG` (Organization) - `B-PRODUCT` (Product) - `B-DATE` (Date) - `B-TIME` - `B-LANGUAGE` - `B-GPE` ## Metrics - **Validation accuracy = 0.9793** - **val_loss: 0.1141** - **Precision: 0.97** - **Recall: 0.97** - **F1-Score: 0.97** ## Usage You can use this model with the Hugging Face Transformers library to perform NER tasks on your own Uzbek language text. ```python from transformers import pipeline # Load the NER model ner_pipeline = pipeline('ner', model='jamshidahmadov/roberta-ner-uz', tokenizer='jamshidahmadov/roberta-ner-uz') # Example usage text = "Shvetsiya bosh vaziri Stefan Lyoven Stokholmdagi Spendrups kompaniyasiga tashrif buyurdi." entities = ner_pipeline(text) for entity in entities: print(entity)