Modeling intra- and inter-pair correlation via heterogeneous high-order preserving for cross-modal retrieval

Leiquan Wang,Weichen Sun,Zhicheng Zhao,Fei Su

doi:10.1016/j.sigpro.2016.08.012

Abstract

Cross modal (e.g., text-to-image or image-to-text) retrieval has received great attention with the flushed multi-modal social media data. It is of considerable challenge to stride across the heterogeneous gap between modalities. Existing methods project different modalities into a common space by minimizing the distance within the heterogeneous pairs (intra-pair) of the new latent space. However, the relationship among these multi-modal pairs (inter-pair) are neglected, which are beneficial to eliminate the heterogeneity. In this paper, we propose a novel algorithm based on canonical correlation analysis by considering the high-order relationship among pairs (HCCA) for cross-modal retrieval. Supervised with additional semantic labels and unsupervised without semantic labels are simultaneously considered by treating the intra- and inter-pair correlation discriminatively. Moreover, kernel tricks are also performed on HCCA to learn a non-linear projection, termed HKCCA. Extensive experiments conducted on three public datasets demonstrate the superiority of the proposed methods compared with the state-of-the-art approaches in cross modal retrieval.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Modeling intra- and inter-pair correlation via heterogeneous high-order preserving for cross-modal retrieval

Abstract

Talk to us

Similar Papers

More From: Signal Processing

Lead the way for us

Journal: Signal Processing	Publication Date: Aug 11, 2016
Citations: 20

Similar Papers

Aknowledgement
-
Journal of Economic Behavior and Organization | VOL. 4
--
01 Dec 1983
Journal of Economic Behavior and Organization | VOL. 4

Clustering-driven Deep Adversarial Hashing for scalable unsupervised cross-modal retrieval
Xiao Shen ... Li Liu
Neurocomputing | VOL. 459
Xiao Shen, et. al.Xiao Shen ... Li Liu
29 Jun 2021
Neurocomputing | VOL. 459

Deep Triplet Neural Networks with Cluster-CCA for Audio-Visual Cross-Modal Retrieval
Donghuo Zeng ... Yi Yu
ACM Transactions on Multimedia Computing, Communications, and Applications | VOL. 16
Donghuo Zeng, et. al.Donghuo Zeng ... Yi Yu
14 Jul 2020
ACM Transactions on Multimedia Computing, Communications, and Applications | VOL. 16

Dual Adversarial Graph Neural Networks for Multi-label Cross-modal Retrieval
Shengsheng Qian ... Quan Fang
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 35
Shengsheng Qian, et. al.Shengsheng Qian ... Quan Fang
18 May 2021
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 35

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Modeling intra- and inter-pair correlation via heterogeneous high-order preserving for cross-modal retrieval

Abstract

Talk to us

Similar Papers

More From: Signal Processing