metadata

library_name: transformers
tags:
  - generated_from_trainer
model-index:
  - name: SkLIP-masked
    results: []
license: mit
datasets:
  - joshuachou/SkinCAP
language:
  - en
base_model:
  - allenai/scibert_scivocab_uncased
  - openai/clip-vit-base-patch32
pipeline_tag: feature-extraction

SkLIP

SkLIP (Skin CLIP) is a hybrid CLIP model finetuned on the SkinCAP, a multi-modal dermatology dataset annotated with rich medical captions. It is built witha SciBERT text encoder and the pre-trained CLIP-32 vision encoder.

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

learning_rate: 1e-05
train_batch_size: 64
eval_batch_size: 64
seed: 42
optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
lr_scheduler_type: cosine
lr_scheduler_warmup_ratio: 0.01
num_epochs: 10

Training results

Training Loss	Epoch	Step	Validation Loss
4.2018	1.0	57	4.1344
4.1697	2.0	114	4.1298
4.1668	3.0	171	4.1276
4.164	4.0	228	4.1263
4.158	5.0	285	4.1253
4.1583	6.0	342	4.1246
4.1569	7.0	399	4.1243
4.1575	8.0	456	4.1241
4.1564	9.0	513	4.1240
4.1604	10.0	570	4.1240

Framework versions

Transformers 4.45.2
Pytorch 2.1.0+cu118
Datasets 3.0.1
Tokenizers 0.20.1