Image-to-text models Collection of image captioning models Salesforce/blip-image-captioning-large Image-to-Text • Updated Dec 7, 2023 • 1.75M • • 1.23k microsoft/git-large-coco Image-to-Text • Updated Jun 26, 2023 • 6.84k • • 101 Salesforce/instructblip-vicuna-7b Image-Text-to-Text • Updated Nov 21 • 253k • 85 Salesforce/blip2-flan-t5-xxl Image-Text-to-Text • Updated Nov 21 • 9.62k • 85
SigLIP release SigLIP improves upon CLIP with a sigmoid loss. Both English-only and multilingual checkpoints are released. Sigmoid Loss for Language Image Pre-Training Paper • 2303.15343 • Published Mar 27, 2023 • 5 google/siglip-base-patch16-224 Zero-Shot Image Classification • Updated Sep 26 • 745k • 32 google/siglip-base-patch16-256 Zero-Shot Image Classification • Updated Sep 26 • 3.52k • 5 google/siglip-base-patch16-384 Zero-Shot Image Classification • Updated Sep 26 • 244k • 9
nielsr/dinov2-with-registers-giant-imagenet1k-1-layer Image Classification • Updated about 18 hours ago • 3
nielsr/dinov2-with-registers-base-imagenet1k-1-layer Image Classification • Updated about 18 hours ago • 4
nielsr/dinov2-with-registers-large-imagenet1k-1-layer Image Classification • Updated about 18 hours ago • 3
nielsr/dinov2-with-registers-small-imagenet1k-1-layer Image Classification • Updated about 18 hours ago • 6