arxiv:2312.01711

Regressor-Segmenter Mutual Prompt Learning for Crowd Counting

Published on Dec 4, 2023

Authors:

Zhaoyi Yan ,

Binghui Chen ,

Abstract

Crowd counting has achieved significant progress by training regressors to predict instance positions. In heavily crowded scenarios, however, regressors are challenged by uncontrollable annotation variance, which causes density map bias and context information inaccuracy. In this study, we propose mutual prompt learning (mPrompt), which leverages a regressor and a segmenter as guidance for each other, solving bias and inaccuracy caused by annotation variance while distinguishing foreground from background. In specific, mPrompt leverages point annotations to tune the segmenter and predict pseudo head masks in a way of point <PRE_TAG>prompt learning</POST_TAG>. It then uses the predicted segmentation masks, which serve as spatial constraint, to rectify biased point annotations as context <PRE_TAG>prompt learning</POST_TAG>. mPrompt defines a way of mutual information maximization from prompt learning, mitigating the impact of annotation variance while improving model accuracy. Experiments show that mPrompt significantly reduces the Mean Average Error (MAE), demonstrating the potential to be general framework for down-stream vision tasks.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment

No model linking this paper

Cite arxiv.org/abs/2312.01711 in a model README.md to link it from this page.

No dataset linking this paper

Cite arxiv.org/abs/2312.01711 in a dataset README.md to link it from this page.

No Space linking this paper

Cite arxiv.org/abs/2312.01711 in a Space README.md to link it from this page.

No Collection including this paper

Add this paper to a collection to link it from this page.