Learning Cross-Media Joint Representation With Sparse and Semisupervised Regularization

Xiaohua Zhai,Jianguo Xiao,Yuxin Peng

doi:10.1109/tcsvt.2013.2276704

Abstract

Cross-media retrieval has become a key problem in both research and application, in which users can search results across all of the media types (text, image, audio, video, and 3-D) by submitting a query of any media type. How to measure the content similarity among different media is the key challenge. Existing cross-media retrieval methods usually focus on modeling the pairwise correlation or semantic information separately. In fact, these two kinds of information are complementary to each other and optimizing them simultaneously can further improve the accuracy. In this paper, we propose a novel feature learning algorithm for cross-media data, called joint representation learning (JRL), which is able to explore jointly the correlation and semantic information in a unified optimization framework. JRL integrates the sparse and semisupervised regularization for different media types into one unified optimization problem, while existing feature learning methods generally focus on a single media type. On one hand, JRL learns sparse projection matrix for different media simultaneously, so different media can align with each other, which is robust to the noise. On the other hand, both the labeled data and unlabeled data of different media types are explored. Unlabeled examples of different media types increase the diversity of training data and boost the performance of joint representation learning. Furthermore, JRL can not only reduce the dimension of the original features, but also incorporate the cross-media correlation into the final representation, which further improves the performance of both cross-media retrieval and single-media retrieval. Experiments on two datasets with up to five media types show the effectiveness of our proposed approach, as compared with the state-of-the-art methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Learning Cross-Media Joint Representation With Sparse and Semisupervised Regularization

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems for Video Technology

Lead the way for us

Journal: IEEE Transactions on Circuits and Systems for Video Technology	Publication Date: Jun 1, 2014
Citations: 269

Similar Papers

Cross-Media Feature Learning Framework with Semi-supervised Graph Regularization
Tingting Qi ... Hong Zhang
-
Tingting Qi, et. al.Tingting Qi ... Hong Zhang
01 Jan 2018
01 Jan 2018

Cross-media retrieval by intra-media and inter-media correlation mining
Xiaohua Zhai ... Yuxin Peng
Multimedia Systems | VOL. 19
Xiaohua Zhai, et. al.Xiaohua Zhai ... Yuxin Peng
19 Dec 2012
Multimedia Systems | VOL. 19

Semi-Supervised Cross-Media Feature Learning With Unified Patch Graph Regularization
Yuxin Peng ... Xin Huang
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 26
Yuxin Peng, et. al.Yuxin Peng ... Xin Huang
01 Mar 2016
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 26

Cross-media retrieval based on semi-supervised regularization and correlation learning
Hong Zhang ... Du Tang
Multimedia Tools and Applications | VOL. 77
Hong Zhang, et. al.Hong Zhang ... Du Tang
05 May 2018
Multimedia Tools and Applications | VOL. 77

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Learning Cross-Media Joint Representation With Sparse and Semisupervised Regularization

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems for Video Technology