Captions Are Worth a Thousand Words: Enhancing Product Retrieval with Pretrained Image-to-Text Models Paper • 2402.08532 • Published Feb 13, 2024
Shopping Queries Image Dataset (SQID): An Image-Enriched ESCI Dataset for Exploring Multimodal Learning in Product Search Paper • 2405.15190 • Published May 24, 2024