Is this model the image-text retrieval model (BLIP-2 ViT-g) fine-tuned on the COCO dataset ？

by XiangLiu03 - opened 29 days ago

29 days ago

"Hello, may I ask if this model is the image-text retrieval model (BLIP-2 ViT-g) fine-tuned on the COCO dataset as mentioned in the BLIP-2 paper?"

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment