Mixup Mask Adaptation: Bridging the gap between input saliency and representations via attention mechanism in feature mixup

Minsoo Kang,Minkoo Kang,Seong-Whan Lee,Suhyun Kim

doi:10.1016/j.imavis.2024.105013

Abstract

The inherent complexity and extensive architecture of deep neural networks often lead to overfitting, compromising their ability to generalize to new, unseen data. One of the regularization techniques, data augmentation, is now considered vital to alleviate this, and mixup, which blends pairs of images and labels, has proven effective in enhancing model generalization. Recently, incorporating saliency in mixups has shown performance gains by retaining salient regions in mixed results. While these methods have become mainstream at the input level, their applications at the feature level remain under-explored. Our observations indicate that outcomes from naive applications of input saliency-based methods did not consistently lead to enhancements in performance. In this paper, we attribute these observations primarily to two challenges: ‘Hard Boundary Issue’ and ‘Saliency Mismatch.’ The Hard Boundary Issue describes a situation where masks with distinct, sharp edges work well at the input level, but lead to unintended distortions in the deeper layers. The Saliency Mismatch points to the disparity between saliency masks generated from input images and the saliency of feature maps. To tackle these challenges, we present a novel method called ‘attention-based mixup mask adaptation’ (MMA). This approach employs an attention mechanism to effectively adapt mixup masks, which are designed to maximize saliency at the input level, for feature augmentation purposes. We reduce the Saliency Mismatch problem by incorporating the spatial significance of the feature map into the mixup mask. Additionally, we address the Hard Boundary Issue by applying softmax to smoothen the adjusted mixup mask. Through comprehensive experiments, we validate our observations and confirm the effectiveness of applying MMA to saliency-aware mixup approaches at the feature level, as evidenced by the performance improvements on multiple benchmarks and the robustness improvements against corruption and deformation.

Full Text

Published Version

View

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

Mixup Mask Adaptation: Bridging the gap between input saliency and representations via attention mechanism in feature mixup

Abstract

Published Version

Talk to us

Similar Papers

More From: Image and Vision Computing

Lead the way for us

Journal: Image and Vision Computing	Publication Date: Apr 8, 2024
License type: cc-by

Similar Papers

Malaysia traffic sign recognition with convolutional neural network
Mian Mian Lau ... King Hann Lim
-
Mian Mian Lau, et. al.Mian Mian Lau ... King Hann Lim
01 Jul 2015
01 Jul 2015

A Gaussian Mixture Model layer jointly optimized with discriminative features within a Deep Neural Network architecture
Ehsan Variani ... Erik Mcdermott
-
Ehsan Variani, et. al.Ehsan Variani ... Erik Mcdermott
01 Apr 2015
01 Apr 2015

Deep Neural Networks for ECG-Based Pulse Detection during Out-of-Hospital Cardiac Arrest.
Andoni Elola ... Artzai Picón
Entropy (Basel, Switzerland) | VOL. 21
Andoni Elola, et. al.Andoni Elola ... Artzai Picón
21 Mar 2019
Entropy (Basel, Switzerland) | VOL. 21

Multi-Channel Deep Feature Learning for Intrusion Detection
Giuseppina Andresini ... Annalisa Appice
IEEE Access | VOL. 8
Giuseppina Andresini, et. al.Giuseppina Andresini ... Annalisa Appice
01 Jan 2020
IEEE Access | VOL. 8

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Mixup Mask Adaptation: Bridging the gap between input saliency and representations via attention mechanism in feature mixup

Abstract

Published Version

Talk to us

Similar Papers

More From: Image and Vision Computing