vazish's picture
Update README.md
10e3808 verified
|
raw
history blame
2.56 kB
---
language: en
thumbnail: https://huggingface.co/front/thumbnails/google.png
license: apache-2.0
base_model:
- cross-encoder/ms-marco-TinyBERT-L-2-v2
pipeline_tag: text-classification
library_name: transformers
metrics:
- f1
- precision
- recall
datasets:
- Mozilla/autofill_dataset
---
## Cross-Encoder for MS Marco with TinyBert
This is a fine-tuned version of the model checkpointed at [cross-encoder/ms-marco-TinyBert-L-2](https://huggingface.co/cross-encoder/ms-marco-TinyBERT-L-2).
It was fine-tuned on html tags and labels generated using [Fathom](https://mozilla.github.io/fathom/commands/label.html).
## How to use this model in `transformers`
```python
from transformers import pipeline
classifier = pipeline(
"text-classification",
model="Mozilla/tinybert-uncased-autofill"
)
print(
classifier('<input class="cc-number" placeholder="Enter credit card number..." />')
)
```
## Model Training Info
```python
HyperParameters: {
'learning_rate': 0.000082,
'num_train_epochs': 71,
'weight_decay': 0.1,
'per_device_train_batch_size': 32,
}
```
More information on how the model was trained can be found here: https://github.com/mozilla/smart_autofill
# Model Performance
```
Test Performance:
Precision: 0.9653
Recall: 0.9648
F1: 0.9644
precision recall f1-score support
CC Expiration 1.000 0.625 0.769 16
CC Expiration Month 0.919 0.944 0.932 36
CC Expiration Year 0.897 0.946 0.921 37
CC Name 0.938 0.968 0.952 31
CC Number 0.926 1.000 0.962 50
CC Payment Type 0.903 0.867 0.884 75
CC Security Code 0.975 0.951 0.963 41
CC Type 0.917 0.786 0.846 14
Confirm Password 0.911 0.895 0.903 57
Email 0.933 0.959 0.946 73
First Name 0.833 1.000 0.909 5
Form 0.974 0.974 0.974 39
Last Name 0.667 0.800 0.727 5
New Password 0.929 0.938 0.933 97
Other 0.985 0.985 0.985 1235
Phone 1.000 0.667 0.800 3
Zip Code 0.909 0.938 0.923 32
accuracy 0.965 1846
macro avg 0.919 0.897 0.902 1846
weighted avg 0.965 0.965 0.964 1846
```