Fine-grained Cross-media Representation Learning with Deep Quantization Attention Network

Meiyu Liang,Zhe Xue,Yue Geng,Congxian Yang,Wu Liu,Junping Du

doi:10.1145/3343031.3350892

Abstract

Cross-media search is useful for getting more comprehensive and richer information about social network hot topics or events. To solve the problems of feature heterogeneity and semantic gap of different media data, existing deep cross-media quantization technology provides an efficient and effective solution for cross-media common semantic representation learning. However, due to the fact that social network data often exhibits semantic sparsity, diversity, and contains a lot of noise, the performance of existing cross-media search methods often degrades. To address the above issue, this paper proposes a novel fine-grained cross-media representation learning model with deep quantization attention network for social network cross-media search (CMSL). First, we construct the image-word semantic correlation graph, and perform deep random walks on the graph to realize semantic expansion and semantic embedding learning, which can discover some potential semantic correlations between images and words. Then, in order to discover more fine-grained cross-media semantic correlations, a multi-scale fine-grained cross-media semantic correlation learning method that combines global and local saliency semantic similarity is proposed. Third, the fine-grained cross-media representation, cross-media semantic correlations and binary quantization code are jointly learned by a unified deep quantization attention network, which can preserve both inter-media correlations and intra-media similarities, by minimizing both cross-media correlation loss and binary quantization loss. Experimental results demonstrate that CMSL can generate high-quality cross-media common semantic representation, which yields state-of-the-art cross-media search performance on two benchmark datasets, NUS-WIDE and MIR-Flickr 25k.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Fine-grained Cross-media Representation Learning with Deep Quantization Attention Network

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Cross-Media Semantic Correlation Learning Based on Deep Hash Network and Semantic Expansion for Social Network Cross-Media Search.
Meiyu Liang ... Haisheng Li
IEEE transactions on neural networks and learning systems | VOL. 31
Meiyu Liang, et. al.Meiyu Liang ... Haisheng Li
11 Dec 2019
IEEE transactions on neural networks and learning systems | VOL. 31

Cross‐media search method based on complementary attention and generative adversarial network for social networks
Lei Shi ... Junping Du
International Journal of Intelligent Systems | VOL. 37
Lei Shi, et. al.Lei Shi ... Junping Du
03 Nov 2021
International Journal of Intelligent Systems | VOL. 37

Remaining Useful Life Prediction Based on Deep Residual Attention Network
Biao Wang ... Yaguo Lei
-
Biao Wang, et. al.Biao Wang ... Yaguo Lei
01 Aug 2019
01 Aug 2019

Super-Resolution of GF-1 Multispectral Wide Field of View Images via a Very Deep Residual Coordinate Attention Network
Rongjie Liu ... Yi Ma
IEEE Geoscience and Remote Sensing Letters | VOL. 19
Rongjie Liu, et. al.Rongjie Liu ... Yi Ma
01 Jan 2021
IEEE Geoscience and Remote Sensing Letters | VOL. 19

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Fine-grained Cross-media Representation Learning with Deep Quantization Attention Network

Abstract

Talk to us

Similar Papers