File size: 1,517 Bytes
b7a7f3a
 
 
 
 
 
 
 
 
 
1136ea3
 
 
 
4ad1b0d
1136ea3
5089e26
 
 
1136ea3
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
---
license: other
language:
- en
pipeline_tag: text-classification
tags:
- legal
- privacy
- intent
- privacy policies
---

# privacy_intent for privacy policy intent classification

This model is fine-tuned version of [mukund/privbert](https://huggingface.co/mukund/privbert) model on [PolicyIE dataset ](https://github.com/wasiahmad/PolicyIE/blob/main/data/sanitized_split.zip).

- Reference Paper: [Intent Classification and Slot Filling for Privacy Policies](https://aclanthology.org/2021.acl-long.340/).
- The back translation method (data augmentation) resulted in a 1% improvement in performance when applied to imbalanced samples
- **F1 Score: 88** (%4 performance increase compared to original work)
  
<b>5 Intents (Labels)</b>: 

*(1) Data Collection/Usage: What, why and how user information is collected;*

*(2) Data Sharing/Disclosure: What, why and how user information is shared with or collected by third parties;*

*(3) Data Storage/Retention: How long and where user information will be stored;*

*(4) Data Security/Protection: Protection measures for user information;*

*(5) Other: Other privacy practices that do not fall into the above four categories.*

```python
from transformers import pipeline

pipe = pipeline("text-classification", "remzicam/privacy_intent")
text="At any time during your use of the Services, you may decide to share some information or content publicly or privately."
pipe(text)
```

Output

```
[{'label': 'data-sharing-disclosure', 'score': 0.8373807072639465}]

```