|
|
|
--- |
|
language: en |
|
license: mit |
|
tags: |
|
- VIT |
|
- image-classification |
|
- drowsiness-detection |
|
--- |
|
|
|
# VIT_Drowsiness_2 |
|
|
|
This model is a fine-tuned version of [google/vit-base-patch16-224](https://huggingface.co/google/vit-base-patch16-224) for drowsiness detection. |
|
|
|
## Model description |
|
|
|
This model is a Vision Transformer (ViT) fine-tuned for drowsiness detection. It classifies images into two categories: drowsy and not drowsy. |
|
|
|
## Intended uses & limitations |
|
|
|
This model is intended for drowsiness detection in images. It should be used on facial images similar to those in the training dataset. |
|
|
|
## Training data |
|
|
|
The model was trained on a custom dataset located at /kaggle/input/nthuddd2/train_data. The dataset was split into 70% training, 15% validation, and 15% test sets. |
|
|
|
## Training procedure |
|
|
|
The model was trained for 10 epochs using the Lion optimizer with a learning rate of 0.0001 and weight decay of 0.01. A cosine learning rate scheduler with 0.1 warmup ratio was used. |
|
|
|
## Evaluation results |
|
|
|
[Add your evaluation results here after training] |
|
|