Deep Multimodal Complementarity Learning.

Daheng Wang,Tong Zhao,Wenhao Yu,Nitesh V Chawla,Meng Jiang

doi:10.1109/tnnls.2022.3165180

Abstract

Complementarity plays a significant role in the synergistic effect created by different components of a complex data object. Complementarity learning on multimodal data has fundamental challenges of representation learning because the complementarity exists along with multiple modalities and one or multiple items of each modality. Also, an appropriate metric is needed for measuring the complementarity in the representation space. Existing methods that rely on similarity-based metrics cannot adequately capture the complementarity. In this work, we propose a novel deep architecture for systematically learning the complementarity of components from multimodal multi-item data. The proposed model consists of three major modules: 1) unimodal aggregation for extracting the intramodal complementarity; 2) cross-modal fusion for extracting the intermodal complementarity at the modality level; and 3) interactive aggregation for extracting the intermodal complementarity at the item level. To quantify complementarity, we utilize the TUBE distance metric to measure the difference between the composited data object and its label in the representation space. Experiments on three real datasets show that our model outperforms the state-of-the-art by +6.8% of mean reciprocal rank (MRR) on object classification and +3.0% of MRR on hold-out item prediction. Qualitative analyses reveal that complementarity is significantly different from similarity.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Deep Multimodal Complementarity Learning.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Neural Networks and Learning Systems

Lead the way for us

Journal: IEEE Transactions on Neural Networks and Learning Systems	Publication Date: Dec 1, 2023
Citations: 5

Similar Papers

Self-Supervised Multimodal Learning: A Survey.
Yongshuo Zong ... Timothy Hospedales
IEEE transactions on pattern analysis and machine intelligence | VOL. PP
Yongshuo Zong, et. al.Yongshuo Zong ... Timothy Hospedales
01 Jan 2024
IEEE transactions on pattern analysis and machine intelligence | VOL. PP

Effective Sentiment Analysis for Multimodal Review Data on the Web
Peiquan Jin ... Lin Mu
-
Peiquan Jin, et. al.Peiquan Jin ... Lin Mu
01 Jan 2020
01 Jan 2020

Deep Multi-modal Latent Representation Learning for Automated Dementia Diagnosis
Tao Zhou ... Ling Shao
-
Tao Zhou, et. al.Tao Zhou ... Ling Shao
01 Jan 2019
01 Jan 2019

Task Recommendation Method Combining Multimodal Cognition and Collaboration in Mobile Crowdsensing Systems
Jian Wang ... Guosheng Zhao
Computer Networks | VOL. 229
Jian Wang, et. al.Jian Wang ... Guosheng Zhao
22 Apr 2023
Computer Networks | VOL. 229

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Deep Multimodal Complementarity Learning.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Neural Networks and Learning Systems