Model Card for Model ID
A fine-tune of Google's ViT-384 model for multi-label image classification on tongue images.
Model Details
Model Description
The model will predict the presence/absence of three features; Cracks, Red Dots and Toothmarks.
- Model type: Vision Transformer
- Finetuned from model [optional]: https://huggingface.co/google/vit-base-patch16-384
- Downloads last month
- 224
Inference Providers
NEW
This model is not currently available via any of the supported third-party Inference Providers, and
the model is not deployed on the HF Inference API.