import streamlit as st # Custom CSS for better styling st.markdown(""" """, unsafe_allow_html=True) # Main Title st.markdown('
This app utilizes the nerdl_fewnerd_subentity_100d model, which is trained on the Few-NERD/inter public dataset to detect 66 entities with high accuracy. The model is based on 100d GloVe embeddings, ensuring robust entity detection.
Entity Recognition is a task in Natural Language Processing (NLP) that involves identifying and classifying named entities in text into predefined categories. For general texts, this model focuses on detecting a wide range of entities, which are crucial for understanding and analyzing diverse content.
The nerdl_fewnerd_subentity_100d model is highly effective for extracting named entities from general texts. Its applications include:
Why use the nerdl_fewnerd_subentity_100d model?
building-theater
, art-other
, location-bodiesofwater
, other-god
, organization-politicalparty
, product-other
, building-sportsfacility
, building-restaurant
, organization-sportsleague
, event-election
, organization-media/newspaper
, product-software
, other-educationaldegree
, person-politician
, person-soldier
, other-disease
, product-airplane
, person-athlete
, location-mountain
, organization-company
, other-biologything
, location-other
, other-livingthing
, person-actor
, organization-other
, event-protest
, art-film
, other-award
, other-astronomything
, building-airport
, product-food
, person-other
, event-disaster
, product-weapon
, event-sportsevent
, location-park
, product-ship
, building-library
, art-painting
, building-other
, other-currency
, organization-education
, person-scholar
, organization-showorganization
, person-artist/author
, product-train
, location-GPE
, product-car
, art-writtenart
, event-attack/battle/war/militaryconflict
, other-law
, other-medical
, organization-sportsteam
, art-broadcastprogram
, art-music
, organization-government/governmentagency
, other-language
, event-other
, person-director
, other-chemicalthing
, product-game
, organization-religion
, location-road/railway/highway/transit
, location-island
, building-hotel
, building-hospital
Attribute | Description |
---|---|
Model Name | nerdl_fewnerd_subentity_100d |
Type | ner |
Compatibility | Spark NLP 3.1.1+ |
License | Open Source |
Edition | Official |
Input Labels | [sentence, token, embeddings] |
Output Labels | [ner] |
Language | en |
Attribute | Description |
---|---|
Dataset | Few-NERD: A Few-shot Named Entity Recognition Dataset |
Authors | Ding, Ning; Xu, Guangwei; Chen, Yulin; Wang, Xiaobin; Han, Xu; Xie, Pengjun; Zheng, Hai-Tao; Liu, Zhiyuan |
Conference | ACL-IJCNL 2021 |
Metric | Score |
---|---|
Precision | 89.45% |
Recall | 91.67% |
F1-Score | 90.55% |
The benchmarking results highlight the performance of the nerdl_fewnerd_subentity_100d model. The metrics used are:
The scores indicate that the model achieves high accuracy and reliability in detecting entities within general scope texts.
The nerdl_fewnerd_subentity_100d model is a powerful tool for entity recognition in general texts, offering high accuracy across a diverse set of entities. Its robust performance, as demonstrated by the benchmark results, makes it suitable for various applications such as text analysis, content classification, and knowledge graph construction. By utilizing this model, users can effectively extract and categorize entities, enhancing their ability to analyze and understand textual data.
For more information and to access the model, visit the John Snow Labs Model Page or the Spark NLP GitHub Repository.