Update README.md
Browse files
README.md
CHANGED
@@ -26,7 +26,7 @@ should probably proofread and complete it, then remove this comment. -->
|
|
26 |
|
27 |
This model is a fine-tuned version of [distilbert-base-uncased](https://huggingface.co/distilbert-base-uncased) on the english only section of ai4privacy/pii-masking-65k dataset.
|
28 |
|
29 |
-
Latest Model: [
|
30 |
|
31 |
## Useage
|
32 |
GitHub Implementation: [Ai4Privacy](https://github.com/Sripaad/ai4privacy)
|
@@ -57,70 +57,12 @@ The following hyperparameters were used during training:
|
|
57 |
|
58 |
## Class wise metrics
|
59 |
It achieves the following results on the evaluation set:
|
|
|
60 |
- Loss: 0.0106
|
61 |
- Overall Precision: 0.9760
|
62 |
- Overall Recall: 0.9801
|
63 |
- Overall F1: 0.9780
|
64 |
- Overall Accuracy: 0.9977
|
65 |
-
- Accountname F1: 1.0
|
66 |
-
- Accountnumber F1: 1.0
|
67 |
-
- Amount F1: 0.9565
|
68 |
-
- Bic F1: 1.0
|
69 |
-
- Bitcoinaddress F1: 1.0
|
70 |
-
- Buildingnumber F1: 0.9753
|
71 |
-
- City F1: 0.9987
|
72 |
-
- Company Name F1: 1.0
|
73 |
-
- County F1: 1.0
|
74 |
-
- Creditcardcvv F1: 0.9701
|
75 |
-
- Creditcardissuer F1: 0.9939
|
76 |
-
- Creditcardnumber F1: 1.0
|
77 |
-
- Currency F1: 0.8668
|
78 |
-
- Currencycode F1: 0.8662
|
79 |
-
- Currencyname F1: 0.7582
|
80 |
-
- Currencysymbol F1: 0.36
|
81 |
-
- Date F1: 0.9944
|
82 |
-
- Displayname F1: 0.5970
|
83 |
-
- Email F1: 1.0
|
84 |
-
- Ethereumaddress F1: 1.0
|
85 |
-
- Firstname F1: 0.9493
|
86 |
-
- Fullname F1: 0.9982
|
87 |
-
- Gender F1: 0.9524
|
88 |
-
- Iban F1: 1.0
|
89 |
-
- Ip F1: 0.5543
|
90 |
-
- Ipv4 F1: 0.8700
|
91 |
-
- Ipv6 F1: 0.8863
|
92 |
-
- Jobarea F1: 0.9806
|
93 |
-
- Jobdescriptor F1: 0.6875
|
94 |
-
- Jobtitle F1: 0.9424
|
95 |
-
- Jobtype F1: 0.8811
|
96 |
-
- Lastname F1: 0.9052
|
97 |
-
- Litecoinaddress F1: 0.9848
|
98 |
-
- Mac F1: 1.0
|
99 |
-
- Maskednumber F1: 1.0
|
100 |
-
- Middlename F1: 0.7364
|
101 |
-
- Name F1: 0.9994
|
102 |
-
- Nearbygpscoordinate F1: 0.5
|
103 |
-
- Number F1: 1.0
|
104 |
-
- Password F1: 1.0
|
105 |
-
- Phoneimei F1: 1.0
|
106 |
-
- Phone Number F1: 1.0
|
107 |
-
- Pin F1: 0.9697
|
108 |
-
- Prefix F1: 0.9540
|
109 |
-
- Secondaryaddress F1: 0.9947
|
110 |
-
- Sex F1: 0.9650
|
111 |
-
- Sextype F1: 0.0
|
112 |
-
- Ssn F1: 1.0
|
113 |
-
- State F1: 0.9965
|
114 |
-
- Street F1: 0.9810
|
115 |
-
- Streetaddress F1: 0.9832
|
116 |
-
- Suffix F1: 0.7928
|
117 |
-
- Time F1: 0.9880
|
118 |
-
- Url F1: 0.9974
|
119 |
-
- Useragent F1: 1.0
|
120 |
-
- Username F1: 0.9746
|
121 |
-
- Vehiclevin F1: 1.0
|
122 |
-
- Vehiclevrm F1: 1.0
|
123 |
-
- Zipcode F1: 0.9969
|
124 |
|
125 |
## Training results
|
126 |
|
|
|
26 |
|
27 |
This model is a fine-tuned version of [distilbert-base-uncased](https://huggingface.co/distilbert-base-uncased) on the english only section of ai4privacy/pii-masking-65k dataset.
|
28 |
|
29 |
+
Latest Model: [Isotonic/mdeberta-v3-base_finetuned_ai4privacy_v2](https://huggingface.co/Isotonic/mdeberta-v3-base_finetuned_ai4privacy_v2)
|
30 |
|
31 |
## Useage
|
32 |
GitHub Implementation: [Ai4Privacy](https://github.com/Sripaad/ai4privacy)
|
|
|
57 |
|
58 |
## Class wise metrics
|
59 |
It achieves the following results on the evaluation set:
|
60 |
+
|
61 |
- Loss: 0.0106
|
62 |
- Overall Precision: 0.9760
|
63 |
- Overall Recall: 0.9801
|
64 |
- Overall F1: 0.9780
|
65 |
- Overall Accuracy: 0.9977
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
66 |
|
67 |
## Training results
|
68 |
|