Few-Shot Text Classification with Triplet Networks, Data Augmentation, and Curriculum Learning

Jason Wei,Soroush Vosoughi,Chengyu Huang,Shiqi Xu,Yu Cheng

doi:10.18653/v1/2021.naacl-main.434

Abstract

Few-shot text classification is a fundamental NLP task in which a model aims to classify text into a large number of categories, given only a few training examples per category. This paper explores data augmentation -- a technique particularly suitable for training with limited data -- for this few-shot, highly-multiclass text classification setting. On four diverse text classification tasks, we find that common data augmentation techniques can improve the performance of triplet networks by up to 3.0% on average. To further boost performance, we present a simple training strategy called curriculum data augmentation, which leverages curriculum learning by first training on only original examples and then introducing augmented data as training progresses. We explore a two-stage and a gradual schedule, and find that, compared with standard single-stage training, curriculum data augmentation trains faster, improves performance, and remains robust to high amounts of noising from augmentation.

Highlights

In traditional text classification tasks, it has been shown that performance improvements can be marginal when training data is Traditional text classification tasks such as sentiment classification (Socher et al, 2013) typically have few output classes, each with many training examples. Many practical scenarios such as relation classification (Han et al, 2018), answer selection (Kumar et al, 2019), and sentence clustering (Mnasri et al, 2017), have a converse setup characterized by a large number of output classes (Gupta et al, 2014), often with few training examples per class
We hypothesize that the few-shot, highly-multiclass text classification scenario is a suitable context for data augmentation
We propose a simple curriculum learning setting in NLP applications and can be challenging strategy called curriculum data augmentation due to the scarcity of training data

Summary

Introduction

Traditional text classification tasks such as sentiment classification (Socher et al, 2013) typically have few output classes (e.g., in binary classification), each with many training examples Many practical scenarios such as relation classification (Han et al, 2018), answer selection (Kumar et al, 2019), and sentence clustering (Mnasri et al, 2017), have a converse setup characterized by a large number of output classes (Gupta et al, 2014), often with few training examples per class. This scenario, which we refer to as few-shot, sufficient, augmentation is especially beneficial in limited data scenarios (Xie et al, 2020). Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pages 5493–5500

Curriculum Data Augmentation

Augmentation Techniques

Triplet Loss Model

Ablation

Related Work and Conclusions

For Various Augmentation Techniques

Findings

A Appendix

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Few-Shot Text Classification with Triplet Networks, Data Augmentation, and Curriculum Learning

Abstract

Highlights

Summary

Talk to us

Similar Papers

Lead the way for us

Publication Date: Jan 1, 2021
Citations: 22	License type: cc-by

Similar Papers

Few-Shot Text Classification with Triplet Networks, Data Augmentation, and Curriculum Learning
...
-
, et. al. ...
25 May 2021
25 May 2021

Few-shot Text Classification with Saliency-equivalent Concatenation
Ying-Jia Lin ... Hung-Yu Kao
-
Ying-Jia Lin, et. al.Ying-Jia Lin ... Hung-Yu Kao
01 Sep 2022
01 Sep 2022

ALP: Data Augmentation Using Lexicalized PCFGs for Few-Shot Text Classification
Hazel H Kim ... Jeong-Won Cha
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 36
Hazel H Kim, et. al.Hazel H Kim ... Jeong-Won Cha
28 Jun 2022
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 36

ContrastNet: A Contrastive Learning Framework for Few-Shot Text Classification
Junfan Chen ... Jie Xu
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 36
Junfan Chen, et. al.Junfan Chen ... Jie Xu
28 Jun 2022
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 36

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Few-Shot Text Classification with Triplet Networks, Data Augmentation, and Curriculum Learning

Abstract

Highlights

Summary

Talk to us

Similar Papers