Image-to-text models Collection of image captioning models Salesforce/blip-image-captioning-large Image-to-Text • 0.5B • Updated Feb 3 • 1.71M • 1.39k microsoft/git-large-coco Image-to-Text • 0.4B • Updated Jun 26, 2023 • 3.67k • 104 Salesforce/instructblip-vicuna-7b Image-Text-to-Text • 8B • Updated Feb 3 • 83.2k • 95 Salesforce/blip2-flan-t5-xxl Image-Text-to-Text • 12B • Updated Feb 3 • 6.97k • 93
SigLIP release SigLIP improves upon CLIP with a sigmoid loss. Both English-only and multilingual checkpoints are released. Sigmoid Loss for Language Image Pre-Training Paper • 2303.15343 • Published Mar 27, 2023 • 8 google/siglip-base-patch16-224 Zero-Shot Image Classification • 0.2B • Updated Sep 26, 2024 • 288k • 60 google/siglip-base-patch16-256 Zero-Shot Image Classification • 0.2B • Updated Sep 26, 2024 • 9.31k • 6 google/siglip-base-patch16-384 Zero-Shot Image Classification • 0.2B • Updated Sep 26, 2024 • 4.34k • 10
google/siglip-base-patch16-224 Zero-Shot Image Classification • 0.2B • Updated Sep 26, 2024 • 288k • 60
google/siglip-base-patch16-256 Zero-Shot Image Classification • 0.2B • Updated Sep 26, 2024 • 9.31k • 6
google/siglip-base-patch16-384 Zero-Shot Image Classification • 0.2B • Updated Sep 26, 2024 • 4.34k • 10
Image-to-text models Collection of image captioning models Salesforce/blip-image-captioning-large Image-to-Text • 0.5B • Updated Feb 3 • 1.71M • 1.39k microsoft/git-large-coco Image-to-Text • 0.4B • Updated Jun 26, 2023 • 3.67k • 104 Salesforce/instructblip-vicuna-7b Image-Text-to-Text • 8B • Updated Feb 3 • 83.2k • 95 Salesforce/blip2-flan-t5-xxl Image-Text-to-Text • 12B • Updated Feb 3 • 6.97k • 93
SigLIP release SigLIP improves upon CLIP with a sigmoid loss. Both English-only and multilingual checkpoints are released. Sigmoid Loss for Language Image Pre-Training Paper • 2303.15343 • Published Mar 27, 2023 • 8 google/siglip-base-patch16-224 Zero-Shot Image Classification • 0.2B • Updated Sep 26, 2024 • 288k • 60 google/siglip-base-patch16-256 Zero-Shot Image Classification • 0.2B • Updated Sep 26, 2024 • 9.31k • 6 google/siglip-base-patch16-384 Zero-Shot Image Classification • 0.2B • Updated Sep 26, 2024 • 4.34k • 10
google/siglip-base-patch16-224 Zero-Shot Image Classification • 0.2B • Updated Sep 26, 2024 • 288k • 60
google/siglip-base-patch16-256 Zero-Shot Image Classification • 0.2B • Updated Sep 26, 2024 • 9.31k • 6
google/siglip-base-patch16-384 Zero-Shot Image Classification • 0.2B • Updated Sep 26, 2024 • 4.34k • 10
nielsr/metaclip-2-huge-worldwide-378 Zero-Shot Image Classification • 2B • Updated 11 days ago • 28 • 1