Research in content-based image retrieval has been around for over a decade. While the research community has successfully exploited content features such as color and texture, finding an effective shape representation and measure remains a challenging task. The shape feature is particularly crucial for the success of content-based systems as it carries meaningful semantics of the objects of interest and fits more naturally into humans' perception of similarity. In this paper, we present our approach to use the shape feature for image retrieval. First, we introduce an effective image decomposition method called Crawling Window (CW) to distinguish the outline of each object in the image. Second, to represent each individual shape, we propose a novel representation model called component Distance Distribution Function and its measure. Traditionally, an object is represented by a set of points on the shape's contour. Our idea is to first compute the distance between each point and the center of the object. The distance values for all points form a signal, which we call Distance Distribution Function (DDF). Each DDF is then divided into component DDFs (cDDF) by taking local signal information into account. Finally, a transformation technique is employed to generate the feature vector for each cDDF. All vectors from the cDDFs in circular order construct the final shape representation. The model is invariant to position, scaling, rotation and starting point. The similarity measure model based on the new representation is also introduced. Our extensive experiments show that our models are more effective than the existing representation model, both in the shape and the image level.
Read full abstract