Multimedia retrieval by deep hashing with multilevel similarity learning

Qiuli Liu,Lu Jin,Zechao Li,Jinhui Tang

doi:10.1016/j.jvcir.2018.11.011

Abstract

Deep multimodal hashing has received increasing research attention in recent years due to its superior performance for large-scale multimedia retrieval. However, limited e orts have been made to explore the complex multilevel semantic structure for deep multimodal hashing. In this paper, we propose a novel deep multimodal hashing method, termed as Deep Hashing with Multilevel Similarity Learning (DHMSL), for learning compact and discriminative hash codes, which explores multilevel semantic similarity correlations of multimedia data. In DHMSL, multilevel similarity correlation is explored to learn the unified binary hash codes by exploiting the local structure and semantic label information simultaneously. Meanwhile, the bit balance and quantization constraints are taken into account to further make the unified hash codes compact. With the unified binary codes learned, two deep neural networks are jointly trained to simultaneously learn feature representations and two sets of nonlinear hash functions. Specifically, the well-designed loss functions are introduced to minimize the prediction errors of the feature representations as well as the errors between the unified binary codes and outputs of the networks. Extensive experiments on two widely-used multimodal datasets demonstrate that the proposed method can achieve the state-of-the-art performance for both image-query-text and text-query-image tasks.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Multimedia retrieval by deep hashing with multilevel similarity learning

Abstract

Talk to us

Similar Papers

More From: Journal of Visual Communication and Image Representation

Lead the way for us

Journal: Journal of Visual Communication and Image Representation	Publication Date: Jan 8, 2019
Citations: 5

Similar Papers

Deep hashing with multilevel similarity learning for multimedia similarity search
Lu Jin ... Qiuli Liu
-
Lu Jin, et. al.Lu Jin ... Qiuli Liu
17 Aug 2018
17 Aug 2018

Learning Discriminative Binary Codes for Large-scale Cross-modal Retrieval.
Xing Xu ... Heng Tao Shen
IEEE Transactions on Image Processing | VOL. 26
Xing Xu, et. al.Xing Xu ... Heng Tao Shen
01 Mar 2017
IEEE Transactions on Image Processing | VOL. 26

Exploiting Subspace Relation in Semantic Labels for Cross-Modal Hashing
Heng Tao Shen ... Yang Yang
IEEE Transactions on Knowledge and Data Engineering | VOL. 33
Heng Tao Shen, et. al.Heng Tao Shen ... Yang Yang
04 Feb 2020
IEEE Transactions on Knowledge and Data Engineering | VOL. 33

Discriminant Cross-modal Hashing
Xing Xu ... Yang Yang
-
Xing Xu, et. al.Xing Xu ... Yang Yang
06 Jun 2016
06 Jun 2016

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multimedia retrieval by deep hashing with multilevel similarity learning

Abstract

Talk to us

Similar Papers

More From: Journal of Visual Communication and Image Representation