Semi-Supervised Cross-Media Feature Learning With Unified Patch Graph Regularization

Yuxin Peng,Yunzhen Zhao,Xiaohua Zhai,Xin Huang

doi:10.1109/tcsvt.2015.2400779

Abstract

With the rapid growth of multimedia data such as text, image, video, audio, and 3-D model, cross-media retrieval has become increasingly important, because users can retrieve the results with various types of media by submitting a query of any media type. Comparing with single-media retrieval such as image retrieval and text retrieval, cross-media retrieval is better because it provides the retrieval results with all kinds of media at the same time. In this paper, we focus on how to learn cross-media features for different media types, which is a key challenge for cross-media retrieval. Existing methods either model different media types separately or only exploit the labeled multimedia data. Actually, the data from different media types with the same semantic category are complementary to each other, and jointly modeling them is able to improve the accuracy of cross-media retrieval. In addition, although the labeled data are accurate, they require a lot of human labor and thus are very scarce. To address the above problems, we propose a semi-supervised cross-media feature learning algorithm with unified patch graph regularization (S $^{\rm 2}$ UPG). Our motivation and contribution mainly lie in the following three aspects. First, existing methods only model different media types in different graphs, while we employ one joint graph to simultaneously model all the media types. The joint graph is able to fully exploit the semantic correlations among various media types, which are complementary to provide the rich hint for cross-media correlation. Second, existing methods only consider the original media instances (such as images, videos, texts, audios, and 3-D models) but ignore their patches, while we make full use of both the media instances and their patches in one graph. Cross-media patches could emphasize the important parts and make cross-media correlations more precise. Third, traditional semi-supervised learning methods only exploit single-media unlabeled instances, while our approach fully exploits cross-media unlabeled instances and their patches, which can increase the diversity of training data and boost the accuracy of cross-media retrieval. Comparing with the current state-of-the-art methods on three datasets, including the challenging XMedia dataset with five media types, the comprehensive experimental results show that our proposed approach performs better.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Semi-Supervised Cross-Media Feature Learning With Unified Patch Graph Regularization

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems for Video Technology

Lead the way for us

Journal: IEEE Transactions on Circuits and Systems for Video Technology	Publication Date: Mar 1, 2016
Citations: 157

Similar Papers

Cross-media retrieval based on semi-supervised regularization and correlation learning
Hong Zhang ... Du Tang
Multimedia Tools and Applications | VOL. 77
Hong Zhang, et. al.Hong Zhang ... Du Tang
05 May 2018
Multimedia Tools and Applications | VOL. 77

Cross-Media Correlation Analysis with Semi-supervised Graph Regularization
Hong Zhang ... Tingting Qi
-
Hong Zhang, et. al.Hong Zhang ... Tingting Qi
01 Jan 2018
01 Jan 2018

Cross-media retrieval by intra-media and inter-media correlation mining
Xiaohua Zhai ... Yuxin Peng
Multimedia Systems | VOL. 19
Xiaohua Zhai, et. al.Xiaohua Zhai ... Yuxin Peng
19 Dec 2012
Multimedia Systems | VOL. 19

Unsupervised Cross-Media Retrieval Using Domain Adaptation With Scene Graph
Yuxin Peng ... Jingze Chi
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 30
Yuxin Peng, et. al.Yuxin Peng ... Jingze Chi
22 Nov 2019
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 30

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Semi-Supervised Cross-Media Feature Learning With Unified Patch Graph Regularization

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems for Video Technology