metadata
tags:
- timm
- feature-extraction
- image-classification
library_name: timm
license: other
license_name: kaiko-non-commercial
license_link: >-
https://github.com/kaiko-ai/towards_large_pathology_fms/blob/a62a0c54719d858371aefa0fcab6ec4b34c86c4c/LICENSE
metrics:
- accuracy
model-index:
- name: kaiko
results:
- task:
type: image-classification
name: Image Classification
dataset:
name: BACH
type: image-classification
metrics:
- type: accuracy
value: 0.87
name: Accuracy
verified: false
- task:
type: image-classification
name: Image Classification
dataset:
name: CRC-NCT-HE
type: image-classification
metrics:
- type: accuracy
value: 0.93
name: Accuracy
verified: false
- task:
type: image-classification
name: Image Classification
dataset:
name: MHIST
type: image-classification
metrics:
- type: accuracy
value: 0.809
name: Accuracy
verified: false
- task:
type: image-classification
name: Image Classification
dataset:
name: PCam
type: image-classification
metrics:
- type: accuracy
value: 0.898
name: Accuracy
verified: false
- task:
type: image-classification
name: Image Classification
dataset:
name: TP53
type: image-classification
metrics:
- type: accuracy
value: 0.656
name: Accuracy
verified: false
- task:
type: image-classification
name: Image Classification
dataset:
name: CoNSeP
type: image-classification
metrics:
- type: accuracy
value: 0.679
name: Accuracy
verified: false
Model card for vit_large_patch14_reg4_224.kaiko_ai_towards_large_pathology_fms
Model Details
- Model Type: Feature backbone
- Model Stats:
- Params: 304M (large)
- Image size: 224 x 224 x 3
- Patch size: 14 x 14 x 3
- Repository: github.com:kaiko-ai/towards_large_pathology_fms
- Original Weights: github.com:kaiko-ai/towards_large_pathology_fms/0.0.1
- Papers:
Model Usage
Image Embeddings
from torchvision.transforms import v2
from PIL import Image
import requests
import torch
import timm
import io
# get example histology image
url = "https://encrypted-tbn0.gstatic.com/images?q=tbn:ANd9GcQc7_xZpGOfQT7sxKwf2w5lL4GAq6IX_CbTzP1NGeenzA&s"
image = Image.open(io.BytesIO(requests.get(url).content))
# load model from the hub
model = timm.create_model(
model_name="hf-hub:1aurent/vit_large_patch14_reg4_224.kaiko_ai_towards_large_pathology_fms",
dynamic_img_size=True,
pretrained=True,
).eval()
# get image transform
preprocessing = v2.Compose(
[
v2.ToImage(),
v2.Resize(size=224),
v2.CenterCrop(size=224),
v2.ToDtype(torch.float32, scale=True),
v2.Normalize(
mean=(0.5, 0.5, 0.5),
std=(0.5, 0.5, 0.5),
),
]
)
data = preprocessing(image).unsqueeze(0) # input is a (batch_size, num_channels, img_size, img_size) shaped tensor
output = model(data) # output is a (batch_size, num_features) shaped tensor
Citation
@misc{ai2024largescale,
title = {Towards Large-Scale Training of Pathology Foundation Models},
author = {kaiko.ai and Nanne Aben and Edwin D. de Jong and Ioannis Gatopoulos and Nicolas Känzig and Mikhail Karasikov and Axel Lagré and Roman Moser and Joost van Doorn and Fei Tang},
year = {2024},
eprint = {2404.15217},
archivePrefix = {arXiv},
primaryClass = {cs.CV}
}