Nonlinear Mixup: Out-Of-Manifold Data Augmentation for Text Classification

Hongyu Guo

doi:10.1609/aaai.v34i04.5822

Abstract

Data augmentation with Mixup (Zhang et al. 2018) has shown to be an effective model regularizer for current art deep classification networks. It generates out-of-manifold samples through linearly interpolating inputs and their corresponding labels of random sample pairs. Despite its great successes, Mixup requires convex combination of the inputs as well as the modeling targets of a sample pair, thus significantly limits the space of its synthetic samples and consequently its regularization effect. To cope with this limitation, we propose “nonlinear Mixup”. Unlike Mixup where the input and label pairs share the same, linear, scalar mixing policy, our approach embraces nonlinear interpolation policy for both the input and label pairs, where the mixing policy for the labels is adaptively learned based on the mixed input. Experiments on benchmark sentence classification datasets indicate that our approach significantly improves upon Mixup. Our empirical studies also show that the out-of-manifold samples generated by our strategy encourage training samples in each class to form a tight representation cluster that is far from others.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Nonlinear Mixup: Out-Of-Manifold Data Augmentation for Text Classification

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the AAAI Conference on Artificial Intelligence	Publication Date: Apr 3, 2020
Citations: 45

Similar Papers

Deep Multi-Scale 3D Convolutional Neural Network (CNN) for MRI Gliomas Brain Tumor Classification.
Hiba Mzoughi ... Chokri Mhiri
Journal of Digital Imaging | VOL. 33
Hiba Mzoughi, et. al.Hiba Mzoughi ... Chokri Mhiri
21 May 2020
Journal of Digital Imaging | VOL. 33

Data Augmentation for Insider Threat Detection with GAN
Fangfang Yuan ... Yanbing Liu
-
Fangfang Yuan, et. al.Fangfang Yuan ... Yanbing Liu
01 Nov 2020
01 Nov 2020

Self‐supervised pre‐training in photovoltaic systems via supervisory control and data acquisition data
Dejun Wang ... Runze Zhu
IET Cyber-Physical Systems: Theory & Applications | VOL. 8
Dejun Wang, et. al.Dejun Wang ... Runze Zhu
27 Apr 2023
IET Cyber-Physical Systems: Theory & Applications | VOL. 8

GAN Inversion for Data Augmentation to Improve Colonoscopy Lesion Classification.
Mayank V Golhar ... Taylor L Bobrow
IEEE journal of biomedical and health informatics | VOL. PP
Mayank V Golhar, et. al.Mayank V Golhar ... Taylor L Bobrow
01 Jan 2024
IEEE journal of biomedical and health informatics | VOL. PP

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Nonlinear Mixup: Out-Of-Manifold Data Augmentation for Text Classification

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence