Wojood - Nested/Flat Arabic NER Models

Wojood is a corpus for Arabic nested Named Entity Recognition (NER). Nested entities occur when one entity mention is embedded inside another entity mention. 550K tokens (MSA and dialect) This repo contains the source-code to train Wojood nested NER.

Online Demo You can try our model using the demo link below

https://sina.birzeit.edu/wojood/

https://arxiv.org/abs/2205.09651

https://huggingface.co/aubmindlab/bert-base-arabertv2/tree/main

Models

  • Nested NER (main branch), with micro-F1 score of 0.909551
  • Flat NER (flat branch), with micro-F1 score 0.883847

Google Colab Notebooks

You can test our model using our Google Colab notebooks

Downloads last month
39
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Space using SinaLab/ArabicWojood-FlatNER 1

Collection including SinaLab/ArabicWojood-FlatNER