File size: 1,851 Bytes
4b1b595 8465980 4b1b595 8465980 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 |
---
license: apache-2.0
datasets:
- ACCORD-NLP/CODE-ACCORD-Entities
language:
- en
---
# ACCORD-NLP
ACCORD-NLP is a Natural Language Processing (NLP) framework developed by the [ACCORD](https://accordproject.eu/) project to facilitate Automated Compliance Checking (ACC) within the Architecture, Engineering, and Construction (AEC) sector.
It consists of several pre-trained/fine-tuned machine learning models to perform the following information extraction tasks from regulatory text.
1. Entity Extraction/Classification (ner)
2. Relation Extraction/Classification (re)
**ner-roberta-large-lm** is a RoBERTa large model fine-tuned for sequence labelling/entity classification using [CODE-ACCORD entities](https://huggingface.co/datasets/ACCORD-NLP/CODE-ACCORD-Entities) dataset, following language modelling using a building regulatory text corpus.
## Installation
### From Source
```
git clone https://github.com/Accord-Project/accord-nlp.git
cd accord-nlp
pip install -r requirements.txt
```
### From pip
```
pip install accord-nlp
```
## Using Pre-trained Models
### Entity Extraction/Classification (ner)
```python
from accord_nlp.text_classification.ner.ner_model import NERModel
model = NERModel('roberta', 'ACCORD-NLP/ner-roberta-large')
predictions, raw_outputs = model.predict(['The gradient of the passageway should not exceed five per cent.'])
print(predictions)
```
### Relation Extraction/Classification (re)
```python
from accord_nlp.text_classification.relation_extraction.re_model import REModel
model = REModel('roberta', 'ACCORD-NLP/re-roberta-large')
predictions, raw_outputs = model.predict(['The <e1>gradient<\e1> of the passageway should not exceed <e2>five per cent</e2>.'])
print(predictions)
```
For more details, please refer to the [ACCORD-NLP](https://github.com/Accord-Project/accord-nlp) GitHub repository. |