The effectiveness of a Content-based Image Retrieval (CBIR) system depends on selecting representative and salient visual features. Capturing the salient image features for forming a feature vector is an essential part of the CBIR process, as it can significantly affect retrieval efficiency. Most images contain a visually prominent region that perceives maximum image semantic information, known as Regions-of-Interest (ROIs). This paper suggests a new CBIR scheme that employs a Graph-based Visual Saliency (GBVS) map to extract the ROI of the image. Further, block-level Discrete Cosine Transforms (DCT) and subsequent histograms are computed for the DC and significant AC coefficients of the extracted ROI image. In addition, a low-dimensional feature vector is formed from the modified histograms. Subsequently, six statistical features from the background scene have also been computed for the final feature vector construction. As a result, the final feature vector contains ROI and background information in their proper proportion to improve retrieval performance. Moreover, the performance of the proposed CBIR scheme has been evaluated using one medical, three natural, two objects, one texture, and one natural heterogeneous image database. All these databases collectively contain 53732 different images from 200 image classes. The outcomes of the experiments demonstrate substantial improvement in the retrieval process with some existing related schemes. At the same time, it also shows that the performance of the proposed scheme is robust concerning all kinds of image databases.
Read full abstract