Robustness test-time augmentation via learnable aggregation and anomaly detection

Haoyu Xiong,Gang Fang,Leixin Yang,Yu Xiang,Yaping Zhang,Junwei Li

doi:10.3233/jifs-236010

Abstract

Test-time augmentation (TTA) has become a widely adopted technique in the computer vision field, which can improve the prediction performance of models by aggregating the predictions of multiple augmented test samples without additional training or hyperparameter tuning. While previous research has demonstrated the effectiveness of TTA in visual tasks, its application in natural language processing (NLP) tasks remains challenging due to complexities such as varying text lengths, discretization of word elements, and missing word elements. These unfavorable factors make it difficult to preserve the label invariance of the standard TTA method for augmented text samples. Therefore, this paper proposes a novel TTA technique called Defy, which combines nearest-neighbor anomaly detection algorithm and an adaptive weighting network architecture with a bidirectional KL divergence entropy regularization term between the original sample and the aggregated sample, to encourage the model to make more consistent and reliable predictions for various augmented samples. Additionally, by comparing with Defy, the paper further explores the problem that common TTA methods may impair the semantic meaning of the text during augmentation, leading to a shift in the model’s prediction results from correct to corrupt. Extensive experimental results demonstrate that Defy consistently outperforms existing TTA methods in various text classification tasks and brings consistent improvements across different mainstream models.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Robustness test-time augmentation via learnable aggregation and anomaly detection

Abstract

Talk to us

Similar Papers

More From: Journal of Intelligent & Fuzzy Systems

Lead the way for us

Similar Papers

STTA: enhanced text classification via selective test-time augmentation.
Haoyu Xiong ... Yaping Zhang
PeerJ. Computer science | VOL. 9
Haoyu Xiong, et. al.Haoyu Xiong ... Yaping Zhang
19 Dec 2023
PeerJ. Computer science | VOL. 9

Learned Text Representation for Amharic Information Retrieval and Natural Language Processing
Tilahun Yeshambel ... Josiane Mothe
Information | VOL. 14
Tilahun Yeshambel, et. al.Tilahun Yeshambel ... Josiane Mothe
20 Mar 2023
Information | VOL. 14

Label Oriented Hierarchical Attention Neural Network for Short Text Classification
-
Academic Journal of Engineering and Technology Science | VOL. 5
--
01 Jan 2021
Academic Journal of Engineering and Technology Science | VOL. 5

A Survey of Adversarial Defenses and Robustness in NLP
Shreya Goyal ... Mitesh M Khapra
ACM Computing Surveys | VOL. 55
Shreya Goyal, et. al.Shreya Goyal ... Mitesh M Khapra
17 Jul 2023
ACM Computing Surveys | VOL. 55

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Robustness test-time augmentation via learnable aggregation and anomaly detection

Abstract

Talk to us

Similar Papers

More From: Journal of Intelligent &amp; Fuzzy Systems

More From: Journal of Intelligent & Fuzzy Systems