Multimodal Sparse Representation Learning and Applications

Miriam Cha,Hsiang-Tsung Kung,Youngjune Gwon

doi:10.46397/jaih.2.2

Abstract

Unsupervised methods have proven effective for discriminative tasks in a single-modality scenario. In this paper, we present a multimodal framework for learning sparse representations that can capture semantic correlation between modalities. The framework can model relationships at a higher level by forcing the shared sparse representation. In particular, we propose the use of joint dictionary learning technique for sparse coding and formulate the joint representation for concision, cross-modal representations (in case of a missing modality), and union of the cross-modal representations. Given the accelerated growth of multimodal data posted on the Web such as YouTube, Wikipedia, and Twitter, learning good multimodal features is becoming increasingly important. We show that the shared representations enabled by our framework substantially improve the classification performance under both unimodal and multimodal settings. We further show how deep architectures built on the proposed framework are effective for the case of highly nonlinear correlations between modalities. The effectiveness of our approach is demonstrated experimentally in image denoising, multimedia event detection and retrieval on the TRECVID dataset (audio-video), category classification on the Wikipedia dataset (image-text), and sentiment classification on PhotoTweet (image-text).

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Multimodal Sparse Representation Learning and Applications

Abstract

Talk to us

Similar Papers

More From: Journal of AI Humanities

Lead the way for us

Similar Papers

SSIM-inspired image restoration using sparse representation
Abdul Rehman ... Mohammad Rostami
EURASIP Journal on Advances in Signal Processing | VOL. 2012
Abdul Rehman, et. al.Abdul Rehman ... Mohammad Rostami
20 Jan 2012
EURASIP Journal on Advances in Signal Processing | VOL. 2012

Algorithms and methods for sparse approximation in structured dictionaries

-

01 Jan 2007
01 Jan 2007

SRNet: Sparse representation-based network for image denoising
Jiechao Sheng ... Qibin Feng
Digital Signal Processing | VOL. 130
Jiechao Sheng, et. al.Jiechao Sheng ... Qibin Feng
26 Aug 2022
Digital Signal Processing | VOL. 130

An Unsupervised Sentiment Classification Method Based on Multi-Level Fuzzy Computing and Multi-Criteria Fusion
Bingkun Wang ... Zhen Yang
IEEE Access | VOL. 8
Bingkun Wang, et. al.Bingkun Wang ... Zhen Yang
01 Jan 2020
IEEE Access | VOL. 8

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multimodal Sparse Representation Learning and Applications

Abstract

Talk to us

Similar Papers

More From: Journal of AI Humanities