Noise Imitation Based Adversarial Training for Robust Multimodal Sentiment Analysis

Ziqi Yuan,Hua Xu,Kai Gao,Yihe Liu

doi:10.1109/tmm.2023.3267882

Abstract

As an inevitable phenomenon in real-world applications, data imperfection has emerged as one of the most critical challenges for multimodal sentiment analysis. However, existing approaches tend to overly focus on a specific type of imperfection, leading to performance degradation in real-world scenarios where multiple types of noise exist simultaneously. In this work, we formulate the imperfection with the modality feature missing at the training period and propose the noise intimation based adversarial training framework to improve the robustness against various potential imperfections at the inference period. Specifically, the proposed method first uses temporal feature erasing as the augmentation for noisy instances construction and exploits the modality interactions through the self-attention mechanism to learn multimodal representation for original-noisy instance pairs. Then, based on paired intermediate representation, a novel adversarial training strategy with semantic reconstruction supervision is proposed to learn unified joint representation between noisy and perfect data. For experiments, the proposed method is first verified with the modality feature missing, the same type of imperfection as the training period, and shows impressive performance. Moreover, we show that our approach is capable of achieving outstanding results for other types of imperfection, including modality missing, automation speech recognition error and attacks on text, highlighting the generalizability of our model. Finally, we conduct case studies on general additive distribution, which introduce background noise and blur into raw video clips, further revealing the capability of our proposed method for real-world applications.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Noise Imitation Based Adversarial Training for Robust Multimodal Sentiment Analysis

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Multimedia

Lead the way for us

Journal: IEEE Transactions on Multimedia	Publication Date: Jan 1, 2024
Citations: 6

Similar Papers

Multimodal sentiment analysis based on fusion methods: A survey
Linan Zhu ... Xiangjie Kong
Information Fusion | VOL. 95
Linan Zhu, et. al.Linan Zhu ... Xiangjie Kong
24 Feb 2023
Information Fusion | VOL. 95

Learning Modality Consistency and Difference Information with Multitask Learning for Multimodal Sentiment Analysis
Cheng Fang ... Tianchi Li
Future Internet | VOL. 16
Cheng Fang, et. al.Cheng Fang ... Tianchi Li
17 Jun 2024
Future Internet | VOL. 16

Multi-level correlation mining framework with self-supervised label generation for multimodal sentiment analysis
Zuhe Li ... Ying Xie
Information Fusion | VOL. 99
Zuhe Li, et. al.Zuhe Li ... Ying Xie
16 Jun 2023
Information Fusion | VOL. 99

A Method with Universal Transformer for Multimodal Sentiment Analysis
Hao Ai ... Sheikh Faisal Rashid
-
Hao Ai, et. al.Hao Ai ... Sheikh Faisal Rashid
23 Sep 2022
23 Sep 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Noise Imitation Based Adversarial Training for Robust Multimodal Sentiment Analysis

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Multimedia