Feature-level Fusion vs. Score-level Fusion for Image Retrieval Based on Pre-trained Deep Neural Networks

Nikolay Neshov,Krasmir Tonchev,Vladimir Poulkov,Agata Manolova,Georgi Balabanov

doi:10.13052/jmm1550-4646.2041

Abstract

Today’s complex multimedia content made retrieving images similar to the user’s query from the database a challenging task. The performance of a Content-Based Image Retrieval System (CBIR) system highly depends on the image representation in a form of low-level features and similarity measurement. The traditional visual descriptors that do not provide good prior domain knowledge could lead to poor performance retrieval results. On the other hand, Deep Convolutional Neural Networks (DCNNs) have recently achieved a remarkable success as methods for image classification in various domains. Recently, pre-trained deep convolution neural networks on thousands of classes have the ability to extract very accurate and representative features which, in addition to classification, can also be successfully used in image retrieval systems. ResNet152, GoogLeNet and InceptionV3 are some of the effective and successful examples of pre-trained DCNNs recently applied in a computer vision tasks such as object recognition, clustering, and classification. In this paper, two approaches for a CBIR system, namely early fusion and late fusion, have been presented and compared. The early fusion utilizes concatenation of the features extracted by each possible pair of DCNNs, that is ResNet152-GoogLeNet, ResNet152-InceptionV3, and GoogLeNet-InceptionV3, and the late fusion apply CombSum method with Z-Score standardization to combine the score results provided by each DCNN of the aforementioned pairs. In the experiments on a popular WANG dataset it has been shown that late fusion approach slightly outperforms early fusion approach. The best performance of our experiments in terms of Average Precision (AP) for the top 20 results reaches 96.82%.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Feature-level Fusion vs. Score-level Fusion for Image Retrieval Based on Pre-trained Deep Neural Networks

Abstract

Talk to us

Similar Papers

More From: Journal of Mobile Multimedia

Lead the way for us

Similar Papers

Automated detection of leukemia by pretrained deep neural networks and transfer learning: A comparison
K.K Anilkumar ... T.M Sagi
Medical Engineering & Physics | VOL. 98
K.K Anilkumar, et. al.K.K Anilkumar ... T.M Sagi
13 Oct 2021
Medical Engineering & Physics | VOL. 98

Apple Fruit Classification and Damage Detection Using Pre-trained Deep Neural Network as Feature Extractor
Gurucharan Kapila ... C H Ajay Kumar
-
Gurucharan Kapila, et. al.Gurucharan Kapila ... C H Ajay Kumar
01 Jan 2021
01 Jan 2021

Rotation Invariant 2D Ear Recognition Using Gabor Filters and Ensemble of Pre-trained Deep Convolutional Neural Network Model
Ravishankar Mehta ... Koushlendra K Singh
-
Ravishankar Mehta, et. al.Ravishankar Mehta ... Koushlendra K Singh
29 Mar 2023
29 Mar 2023

Single-cell conventional pap smear image classification using pre-trained deep neural network architectures
Mohammed Aliy Mohammed ... Yodit Abebe Ayalew
BMC Biomedical Engineering | VOL. 3
Mohammed Aliy Mohammed, et. al.Mohammed Aliy Mohammed ... Yodit Abebe Ayalew
29 Jun 2021
BMC Biomedical Engineering | VOL. 3

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Feature-level Fusion vs. Score-level Fusion for Image Retrieval Based on Pre-trained Deep Neural Networks

Abstract

Talk to us

Similar Papers

More From: Journal of Mobile Multimedia