Defending Multimodal Fusion Models against Single-Source Adversaries

Karren Yang,Filipe Condessa,Zico Kolter,Manash Barman,Wan-Yi Lin

doi:10.1109/cvpr46437.2021.00335

Abstract

Beyond achieving high performance across many vision tasks, multimodal models are expected to be robust to single-source faults due to the availability of redundant information between modalities. In this paper, we investigate the robustness of multimodal neural networks against worst-case (i.e., adversarial) perturbations on a single modality. We first show that standard multimodal fusion models are vulnerable to single-source adversaries: an attack on any single modality can overcome the correct information from multiple unperturbed modalities and cause the model to fail. This surprising vulnerability holds across diverse multimodal tasks and necessitates a solution. Motivated by this finding, we propose an adversarially robust fusion strategy that trains the model to compare information coming from all the input sources, detect inconsistencies in the perturbed modality compared to the other modalities, and only allow information from the unperturbed modalities to pass through. Our approach significantly improves on state-of-the-art methods in single-source robustness, achieving gains of 7.8-25.2% on action recognition, 19.7-48.2% on object detection, and 1.6-6.7% on sentiment analysis, without degrading performance on unperturbed (i.e., clean) data.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Defending Multimodal Fusion Models against Single-Source Adversaries

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Deep Learning with Metadata Augmentation for Classification of Diabetic Retinopathy Level
Maksym Shulha ... Sergii Stirenko
-
Maksym Shulha, et. al.Maksym Shulha ... Sergii Stirenko
01 Jan 2023
01 Jan 2023

Diagnostic efficiency of multi-modal MRI based deep learning with Sobel operator in differentiating benign and malignant breast mass lesions-a retrospective study.
Weixia Tang ... Jiahuan Tang
PeerJ. Computer science | VOL. 9
Weixia Tang, et. al.Weixia Tang ... Jiahuan Tang
17 Jul 2023
PeerJ. Computer science | VOL. 9

Multimodal Deep Learning Network for Differentiating Between Benign and Malignant Pulmonary Ground Glass Nodules.
Gang Liu ... Hui He
Current medical imaging | VOL. -
Gang Liu, et. al.Gang Liu ... Hui He
10 Sep 2024
Current medical imaging | VOL. -

Multimodal deep learning for predicting the choice of cut parameters in the milling process
Cheick Abdoul Kadir A Kounta ... Farid Noureddine
Intelligent Systems with Applications | VOL. 16
Cheick Abdoul Kadir A Kounta, et. al.Cheick Abdoul Kadir A Kounta ... Farid Noureddine
01 Nov 2022
Intelligent Systems with Applications | VOL. 16

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Defending Multimodal Fusion Models against Single-Source Adversaries

Abstract

Talk to us

Similar Papers