Discrete Semantics-Guided Asymmetric Hashing for Large-Scale Multimedia Retrieval

Jun Long,Zhan Yang,Longzhi Sun,Liujie Hua

doi:10.3390/app11188769

Jun Long, Zhan Yang + Show 2 more

Open Access

PDF Available

https://doi.org/10.3390/app11188769

Copy DOI

Export

Save

Cite

Journal: Applied Sciences	Publication Date: Sep 21, 2021
Citations: 2	License type: CC BY 4.0

Affiliation: Central South University

Abstract
Highlights/Summary
Full-Text PDF
Similar Papers

Abstract

Listen

Cross-modal hashing technology is a key technology for real-time retrieval of large-scale multimedia data in real-world applications. Although the existing cross-modal hashing methods have achieved impressive accomplishment, there are still some limitations: (1) some cross-modal hashing methods do not make full consider the rich semantic information and noise information in labels, resulting in a large semantic gap, and (2) some cross-modal hashing methods adopt the relaxation-based or discrete cyclic coordinate descent algorithm to solve the discrete constraint problem, resulting in a large quantization error or time consumption. Therefore, in order to solve these limitations, in this paper, we propose a novel method, named Discrete Semantics-Guided Asymmetric Hashing (DSAH). Specifically, our proposed DSAH leverages both label information and similarity matrix to enhance the semantic information of the learned hash codes, and the ℓ2,1 norm is used to increase the sparsity of matrix to solve the problem of the inevitable noise and subjective factors in labels. Meanwhile, an asymmetric hash learning scheme is proposed to efficiently perform hash learning. In addition, a discrete optimization algorithm is proposed to fast solve the hash code directly and discretely. During the optimization process, the hash code learning and the hash function learning interact, i.e., the learned hash codes can guide the learning process of the hash function and the hash function can also guide the hash code generation simultaneously. Extensive experiments performed on two benchmark datasets highlight the superiority of DSAH over several state-of-the-art methods.

Highlights

In recent years, due to the rapid development of multimedia Internet of Things technologies, there has been an explosive growth in the amount of multimedia network data.the current unimodal search methods can no longer meet the multimedia data retrieval requirements in the complex environment of the new information era.cross-modal retrieval methods [1,2,3] have received increasing attention from the information retrieval community and have become a hot research topic in both academia and industry
Our proposed Discrete Semantics-Guided Asymmetric Hashing (DSAH) leverages both the similarity matrix and label information to enhance the semantic information of the learned hash codes, and solves the problem of noises contained in the labels
On the MIRFlickr dataset, compared to the best baselines, i.e., Subspace Relation Learning for Cross-modal Hashing (SRLCH), the mean average precision (mAP) scores of DSAH have an increase of 2.7% on average, and on the NUS-WIDE dataset, DSAH obtains the highest mAP scores of all compared baselines, which demonstrates the efficacy of DSAH

Summary

Introduction

Due to the rapid development of multimedia Internet of Things technologies, there has been an explosive growth in the amount of multimedia network data. Some cross-modal hashing methods are based on symmetric learning strategies, resulting in a worse retrieval performance than asymmetric learning ones. DSAH handles the nonlinear relations in different modalities with a kernelization technique, an asymmetric learning scheme is proposed to effectively perform the hash function learning and hash code learning processes; our proposed. We leverage both label information and similarity matrix to enhance the semantic information of the learned hash codes. A novel supervised cross-modal hashing method, i.e., DSAH, is proposed to learn the discriminative compact hash codes for large-scale retrieval tasks. DSAH takes the label information and similarity matrix into consideration, which can improve the discriminative capability of the learned hash codes, and solves the problems of matrix sparseness and outlier processing.

Related Works

Unsupervised Hashing

Supervised Hashing

Deep Hashing

The Proposed DSAH Framework

Kernelization

Feature Mapping

Label Alignment Scheme

Asymmetric Learning Framework

The Joint Framework

Optimization

Out-of-Sample Extension

Complexity Analysis

MIRFlickr

NUS-WIDE

Methodology

Implementation Details

Results

Method

Effects of Discrete Optimization

Effects of Kernelization

Effects of Word Embeddings

Effects of Deep Learning Based Representation

Effects of Parameters

Convergence Analysis

Limitations

Conclusions

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

Discrete Semantics-Guided Asymmetric Hashing for Large-Scale Multimedia Retrieval

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: Applied Sciences

Lead the way for us

Similar Papers

WITHDRAWN: SCHEMA: A Discrete Cross-Modal Hashing by Preserving Multiple Similarities
Yongxin Wang ... Xin-Shun Xu
Pattern Recognition | VOL. -
Yongxin Wang, et. al.Yongxin Wang ... Xin-Shun Xu
01 Sep 2019
Pattern Recognition | VOL. -

Deep Cross-Modal Hashing With Hashing Functions and Unified Hash Codes Jointly Learning
Rong-Cheng Tu ... Heyan Huang
IEEE Transactions on Knowledge and Data Engineering | VOL. 34
Rong-Cheng Tu, et. al.Rong-Cheng Tu ... Heyan Huang
02 Apr 2020
IEEE Transactions on Knowledge and Data Engineering | VOL. 34

Discrete Robust Supervised Hashing for Cross-Modal Retrieval
Tao Yao ... Lianshan Yan
IEEE Access | VOL. 7
Tao Yao, et. al.Tao Yao ... Lianshan Yan
01 Jan 2019
IEEE Access | VOL. 7

SDMCH: Supervised Discrete Manifold-Embedded Cross-Modal Hashing
Xin Luo ... Xuemeng Song
-
Xin Luo, et. al.Xin Luo ... Xuemeng Song
01 Jul 2018
01 Jul 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Discrete Semantics-Guided Asymmetric Hashing for Large-Scale Multimedia Retrieval

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: Applied Sciences