I know there is single modality support for text (e.g. the text-to-text retrieval example). Is this also true for image-to-text retrieval? Or image-to-image retrieval? Thanks.
· Sign up or log in to comment