A Two-Stage Triplet Network Training Framework for Image Retrieval

Weiqing Min,Shuqiang Jiang,Zhuo Li,Shuhuan Mei

doi:10.1109/tmm.2020.2974326

Abstract

In this paper, we propose a novel framework for instance-level image retrieval. Recent methods focus on fine-tuning the Convolutional Neural Network (CNN) via a Siamese architecture to improve off-the-shelf CNN features. They generally use the ranking loss to train such networks, and do not take full use of supervised information for better network training, especially with more complex neural architectures. To solve this, we propose a two-stage triplet network training framework, which mainly consists of two stages. First, we propose a Double-Loss Regularized Triplet Network (DLRTN), which extends basic triplet network by attaching the classification sub-network, and is trained via simultaneously optimizing two different types of loss functions. Double-loss functions of DLRTN aim at specific retrieval task and can jointly boost the discriminative capability of DLRTN from different aspects via supervised learning. Second, considering feature maps of the last convolution layer extracted from DLRTN and regions detected from the region proposal network as the input, we then introduce the Regional Generalized-Mean Pooling (RGMP) layer for the triplet network, and re-train this network to learn pooling parameters. Through RGMP, we pool feature maps for each region and aggregate features of different regions from each image to Regional Generalized Activations of Convolutions (R-GAC) as final image representation. R-GAC is capable of generalizing existing Regional Maximum Activations of Convolutions (R-MAC) and is thus more robust to scale and translation. We conduct the experiment on six image retrieval datasets including standard benchmarks and recently introduced INSTRE dataset. Extensive experimental results demonstrate the effectiveness of the proposed framework.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Two-Stage Triplet Network Training Framework for Image Retrieval

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Multimedia

Lead the way for us

Journal: IEEE Transactions on Multimedia	Publication Date: Dec 1, 2020
Citations: 87

Similar Papers

Image Retrieval using CNN and Low-level Feature Fusion for Crime Scene Investigation Image Database
Ying Liu ... Keng-Pang Lim
-
Ying Liu, et. al.Ying Liu ... Keng-Pang Lim
01 Nov 2018
01 Nov 2018

Image and video face retrieval with query image using convolutional neural network features
Imane Hachchane ... Aïcha Sahel
IAES International Journal of Artificial Intelligence (IJ-AI) | VOL. 11
Imane Hachchane, et. al.Imane Hachchane ... Aïcha Sahel
01 Mar 2022
IAES International Journal of Artificial Intelligence (IJ-AI) | VOL. 11

Object-Based Aggregation of Deep Features for Image Retrieval
Yu Bao ... Haojie Li
-
Yu Bao, et. al.Yu Bao ... Haojie Li
31 Dec 2016
31 Dec 2016

Visual tracking with complementary deep feature optimization
Mingquan Shi ... Zhaoming Chen
Journal of Electronic Imaging | VOL. 27
Mingquan Shi, et. al.Mingquan Shi ... Zhaoming Chen
27 Aug 2018
Journal of Electronic Imaging | VOL. 27

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Two-Stage Triplet Network Training Framework for Image Retrieval

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Multimedia