Papers
arxiv:2112.12777

Cross Modal Retrieval with Querybank Normalisation

Published on Dec 23, 2021
Authors:
,
,

Abstract

Profiting from large-scale training datasets, advances in neural architecture design and efficient inference, joint embeddings have become the dominant approach for tackling cross-modal retrieval. In this work we first show that, despite their effectiveness, state-of-the-art joint embeddings suffer significantly from the longstanding "hubness problem" in which a small number of gallery embeddings form the nearest neighbours of many queries. Drawing inspiration from the NLP literature, we formulate a simple but effective framework called Querybank Normalisation (QB-Norm) that re-normalises query similarities to account for hubs in the embedding space. QB-Norm improves retrieval performance without requiring retraining. Differently from prior work, we show that QB-Norm works effectively without concurrent access to any test set queries. Within the QB-Norm framework, we also propose a novel similarity normalisation method, the Dynamic Inverted Softmax, that is significantly more robust than existing approaches. We showcase QB-Norm across a range of cross modal retrieval models and benchmarks where it consistently enhances strong baselines beyond the state of the art. Code is available at https://vladbogo.github.io/QB-Norm/.

Community

Paper author

@librarian-bot recommend

This is an automated message from the Librarian Bot. I found the following papers similar to this paper.

The following papers were recommended by the Semantic Scholar API

Please give a thumbs up to this comment if you found it helpful!

If you want recommendations for any Paper on Hugging Face checkout this Space

You can directly ask Librarian Bot for paper recommendations by tagging it in a comment: @librarian-bot recommend

@librarian-bot recommend

Paper author

@davanstrien I noticed that the @librarian-bot didn't respond to the second request for recommendations. I suspect this happens because there was a response already.

If this is the case, I think ideally it should respond with a link to the other message. I would love to contribute to the project and tackle this case. I would really appreciate if you could point me to the repo, since I wasn't able to find it.

Can you find papers on the early detection of Sepsis? @librarian-bot recommend

@vladbogo

@davanstrien I noticed that the @librarian-bot didn't respond to the second request for recommendations. I suspect this happens because there was a response already.

If this is the case, I think ideally it should respond with a link to the other message. I would love to contribute to the project and tackle this case. I would really appreciate if you could point me to the repo, since I wasn't able to find it.

Most of the code is here: https://huggingface.co/spaces/librarian-bots/recommend_similar_papers

I would prefer to wait with this for a bit before getting a librarian-bot to respond with more comments since I prefer librarian-bot not to be too noisy on papers. We might add threading to paper comments soon, which would then make it possible to reply to a user in a separate thread. At that point, I think it would make sense to allow multiple requests to librarian-bot to account for new papers coming out + to give a user a better message about what's not working. I will add some clearer wording to the comments to make it clearer that librarian-bot will currently only make one comment per paper.

cc @julien-c @pierric

This comment has been hidden
Paper author

@davanstrien makes sense! Great to hear that there will be threading to paper comments soon. Keep up the good work!

P.S. if there are some public tasks/issues related to this project, please let me know.

Sign up or log in to comment

Models citing this paper 0

No model linking this paper

Cite arxiv.org/abs/2112.12777 in a model README.md to link it from this page.

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2112.12777 in a dataset README.md to link it from this page.

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/2112.12777 in a Space README.md to link it from this page.

Collections including this paper 0

No Collection including this paper

Add this paper to a collection to link it from this page.