Abstract

With the advent of the era of big data, information has gradually changed from a single modal to a diversified form, such as image, text, video, audio, etc. With the growth of multimedia data, the key problem faced by cross-media retrieval technology is how to quickly retrieve multimedia data with different modalities of the same semantic. At present, many cross-media retrieval techniques use local annotated samples for training. In this way, the semantic information of the data cannot be fully utilized, and manual annotation is required, which is rather labor-intensive prone to errors and subjective viewing. Therefore, this paper proposes a Semi-Supervised learning based Semantic Cross-Media Retrieval (S3CMR) method aiming at the above problems. The main advantage of this method is to make full use of the relationship between the semantic information of the labeled samples and the unlabeled samples. Simultaneously, we integrate the linear regression term, correlation analysis term, and feature selection term into a joint cross-media learning framework. These terms interact with each other and embed more semantics in the shared subspace. Furthermore, an iterative method guaranteed with convergence is proposed to solve the formulated optimization problem. Experimental results on three publicly available datasets demonstrate that the proposed method outperforms eight state-of-the-art cross-media retrieval methods.

Highlights

  • In the age of big data, the amount of Internet data has increased dramatically

  • Given the limitations of the various methods mentioned above, this paper proposes a Semi-Supervised learning based Semantic Cross-Media Retrieval (S3CMR) method, which considers the potential semantic information of labeled samples and unlabeled samples to further improve the performance of cross-media retrieval

  • Experimental results on three publicly available datasets demonstrate that the proposed method outperforms eight state-of-the-art cross-media retrieval methods

Read more

Summary

INTRODUCTION

In the age of big data, the amount of Internet data has increased dramatically. The internet has changed the way people communicate with life and accelerated the fusion of multimedia data due to the popularity of smartphones. To solve the above mentioned problems, the current crossmedia retrieval idea is mainly to project different modal data into a common subspace and measure the similarity between different modal data by distance calculation method Based on this idea, the existing cross-media retrieval methods are mainly divided into: unsupervised methods and supervised methods. Given the limitations of the various methods mentioned above, this paper proposes a Semi-Supervised learning based Semantic Cross-Media Retrieval (S3CMR) method, which considers the potential semantic information of labeled samples and unlabeled samples to further improve the performance of cross-media retrieval. A Semi-supervised learning based semantic cross-media retrieval method is proposed, using all the images and text information in the training set to learn the projection matrices.

RELATED WORK
SEMI-SUPERVISED LEARNING BASED SEMANTIC
OBJECTIVE FUNCTION
ITERATIVE OPTIMIZATION
EXPERIMENT
Findings
CONCLUSION
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call