Is this model the image-text retrieval model (BLIP-2 ViT-g) fine-tuned on the COCO dataset ?

#2
by XiangLiu03 - opened

"Hello, may I ask if this model is the image-text retrieval model (BLIP-2 ViT-g) fine-tuned on the COCO dataset as mentioned in the BLIP-2 paper?"

Snipaste_2024-10-09_14-08-47.png

Sign up or log in to comment