Abstract

In the modern era of Internet, mobile and digital information technology, image retrieval for object identification, just as wine label retrieval from a wine bottle image, has become an important and urgent problem in artificial intelligence. In comparison with the general image retrieval, it is rather challenging because there are a huge number of object identification or brand images which are very similar and difficult to discriminate, and the number of different brand images in the given dataset changes greatly, that is, the samples are strongly unbalanced for these brands. In this paper, we propose a CNN-SURF Consecutive Filtering and Matching (CSCFM) framework for this kind of image retrieval, specifically focalizing on wine label retrieval. In particular, Convolutional Neural Network (CNN) is utilized to filter out the impossible main-brands (manufacturers) for narrowing down the range of retrieval and the Speeded Up Robust Features (SURF) matching is improved by adopting the RANdom SAmple Consensus (RANSAC) mechanism and the modified Term Frequency–Inverse Document Frequency (TF-IDF) distance for the accurate retrieval of the sub-brand (item attribute under the manufacture). The experiments are conducted on a dataset containing approximately 548k images of wine labels with 17, 328 main-brands and 260, 579 sub-brands. It is demonstrated by the experimental results that our proposed method can solve the wine label retrieval problem effectively and efficiently. Moreover, our proposed method is further evaluated on two pubic benchmarks of the object identification image retrieval tasks, Oxford Buildings Benchmark (Oxford5k) and the University of Kentucky of Indoor Things Benchmark (UKB), and achieves 88.3% mean average precision and 3.92 N-S score in Oxford5k and UKB, respectively.

Highlights

  • Image retrieval has been one of classical research fields in computer vision and image processing

  • The result of the conventional Speeded Up Robust Features (SURF) matching is that A is the best matching to x0. our improved SURF matching considers the difference in the importance of each SURF descriptor, for example, s05 is a unique feature between x0 and D, it is more important than the other SURF descriptors in the matching process, and our decision is that D is the best matching to x0

  • Consecutive Filtering and Matching (CSCFM) retrieval and comparison on a general wine label image dataset to demonstrate the effectiveness of the proposed framework

Read more

Summary

INTRODUCTION

Image retrieval has been one of classical research fields in computer vision and image processing. The SURF descriptor, which is referred to as the CNN-SURF Consecutive Filtering and Matching (CSCFM) framework It is effective for the large scale image retrieval with a dataset which has a category structure, i.e., whose samples can be naturally classified into a number of categories (corresponding to main-brands). This is a kind of categorystructured image retrieval. We conduct extensive experiments on a dataset containing approximately 548k wine images with 17, 328 main-brands and 260, 579 sub-brands It is demonstrated by the experimental results that our proposed CSCFM method can solve the wine label retrieval problem effectively and efficiently.

RELATED WORK
IMPROVED SURF MATCHING FOR THE FINAL SUB-BRANDS
5: Calculate the modified TF-IDF distance Di by
EXPERIMENTAL RESULTS
CSCFM RETRIEVAL AND COMPARISON ON A GENERAL WINE LABEL IMAGE DATASET
CONCLUSION
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call