Model Card for Model ID

A fine-tune of Google's ViT-384 model for multi-label image classification on tongue images.

Model Details

Model Description

The model will predict the presence/absence of three features; Cracks, Red Dots and Toothmarks.

Model type: Vision Transformer
Finetuned from model [optional]: https://huggingface.co/google/vit-base-patch16-384

Downloads last month: 224

Safetensors

Model size

86.1M params

Tensor type

F32

Inference Providers NEW

Image Classification

This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.

e1010101
/

vit-384-tongue-image

Model Card for Model ID

Model Details

Model Description

Space using e1010101/vit-384-tongue-image 1