|
--- |
|
license: other |
|
language: |
|
- en |
|
pipeline_tag: text-classification |
|
tags: |
|
- legal |
|
- privacy |
|
- intent |
|
- privacy policies |
|
--- |
|
|
|
# privacy_intent for privacy policy intent classification |
|
|
|
This model is fine-tuned version of [mukund/privbert](https://huggingface.co/mukund/privbert) model on [PolicyIE dataset ](https://github.com/wasiahmad/PolicyIE/blob/main/data/sanitized_split.zip). |
|
|
|
- Reference Paper: [Intent Classification and Slot Filling for Privacy Policies](https://aclanthology.org/2021.acl-long.340/). |
|
- The back translation method (data augmentation) resulted in a 1% improvement in performance when applied to imbalanced samples |
|
- **F1 Score: 88** (%4 performance increase compared to original work) |
|
|
|
<b>5 Intents (Labels)</b>: |
|
|
|
*(1) Data Collection/Usage: What, why and how user information is collected;* |
|
|
|
*(2) Data Sharing/Disclosure: What, why and how user information is shared with or collected by third parties;* |
|
|
|
*(3) Data Storage/Retention: How long and where user information will be stored;* |
|
|
|
*(4) Data Security/Protection: Protection measures for user information;* |
|
|
|
*(5) Other: Other privacy practices that do not fall into the above four categories.* |
|
|
|
```python |
|
from transformers import pipeline |
|
|
|
pipe = pipeline("text-classification", "remzicam/privacy_intent") |
|
text="At any time during your use of the Services, you may decide to share some information or content publicly or privately." |
|
pipe(text) |
|
``` |
|
|
|
Output |
|
|
|
``` |
|
[{'label': 'data-sharing-disclosure', 'score': 0.8373807072639465}] |
|
|
|
``` |