ABSTRACT Pseudo-relevance feedback (PRF) is a relevance feedback (RF) technique for information retrieval that treats the top k retrieved images as relevance feedback. PRF is used to avoid the limitations of the traditional RF approach, which is a human-in-the-loop process. Although the pseudo-relevance feedback set contains noise, PRF can perform retrieval reasonably effectively. For implementing PRF, the Rocchio algorithm has been considered reasonably effective and is a well-established baseline method. However, it simply treats all of the top k feedback images as being equally similar to the query. Therefore, we present a block-based PRF approach for improving image retrieval performance. In this approach, images in the positive and negative feedback sets are further divided into predefined blocks, each of which contains one to several images, and blocks containing higher- or lower-ranked images will be assigned higher or lower weights, respectively. Experiments using the NUS-WIDE-LITE and Caltech 256 datasets and two different feature representations consistently show that the proposed approach using 30 blocks outperforms the baseline PRF in terms of P@10, P@20, and P@50. Furthermore, we show that a system that incorporates the user’s feedback allows the 30-block-based PRF approach to perform even better.
Read full abstract