shawhin
/

bert-phishing-classifier_student

Text Classification

Inference Endpoints

Model card Files Files and versions Community

bert-phishing-classifier_student

This model is modified version of distilbert/distilbert-base-uncased trained via knowledge distillation from shawhin/bert-phishing-classifier_teacher using the shawhin/phishing-site-classification dataset. It achieves the following results on the testing set:

Loss (training): 0.0563
Accuracy: 0.9022
Precision: 0.9426
Recall: 0.8603
F1 Score: 0.8995

Model description

Student model for knowledge distillation example.

Video | Blog | Example code

Intended uses & limitations

This model was created for educational purposes.

Training and evaluation data

The Training, Testing, and Validation data are available here: shawhin/phishing-site-classification.

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

learning_rate: 0.0001
train_batch_size: 32
eval_batch_size: 32
num_epochs: 5
temperature: 2.0
adam optimizer alpha: 0.5

Downloads last month: 103

Safetensors

Model size

52.8M params

Tensor type

F32

·

Inference Providers NEW

Text Classification

This model is not currently available via any of the supported Inference Providers.

Model tree for shawhin/bert-phishing-classifier_student

Base model

distilbert/distilbert-base-uncased

Finetuned

(7898)

this model

Quantizations

1 model

Dataset used to train shawhin/bert-phishing-classifier_student