Self-supervised Regularization for Text Classification

Meng Zhou,Pengtao Xie,Zechen Li

doi:10.1162/tacl_a_00389

Abstract

AbstractText classification is a widely studied problem and has broad applications. In many real-world problems, the number of texts for training classification models is limited, which renders these models prone to overfitting. To address this problem, we propose SSL-Reg, a data-dependent regularization approach based on self-supervised learning (SSL). SSL (Devlin et al., 2019a) is an unsupervised learning approach that defines auxiliary tasks on input data without using any human-provided labels and learns data representations by solving these auxiliary tasks. In SSL-Reg, a supervised classification task and an unsupervised SSL task are performed simultaneously. The SSL task is unsupervised, which is defined purely on input texts without using any human- provided labels. Training a model using an SSL task can prevent the model from being overfitted to a limited number of class labels in the classification task. Experiments on 17 text classification datasets demonstrate the effectiveness of our proposed method. Code is available at https://github.com/UCSD-AI4H/SSReg.

Highlights

Text classification (Korde and Mahender, 2012; Lai et al, 2015; Wang et al, 2017; Howard and Ruder, 2018) is a widely studied problem in natural language processing and finds broad applications
To address overfitting problems in text classification, we propose a data-dependent regularizer called SSL-Reg based on self-supervised learning (SSL) (Devlin et al, 2019a; He et al, 2019; Chen et al, 2020) and use it to regularize the training of text classification models, where a supervised classification task and an unsupervised SSL task are performed simultaneously
We propose to use self-supervised learning to alleviate overfitting in text classification problems

Summary

Introduction

Text classification (Korde and Mahender, 2012; Lai et al, 2015; Wang et al, 2017; Howard and Ruder, 2018) is a widely studied problem in natural language processing and finds broad applications. Give clinical notes of a patient, judge whether this patient has heart diseases. In many real-world text classification problems, texts available for training are oftentimes limited. It is difficult to obtain a lot of clinical notes from hospitals due to concern of patient privacy. It is well known that when training

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Transactions of the Association for Computational Linguistics	Publication Date: Jul 8, 2021
Citations: 5	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Self-supervised Regularization for Text Classification

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Transactions of the Association for Computational Linguistics

Lead the way for us

Similar Papers

Self-supervised Regularization for Text Classification
...
-
, et. al. ...
25 May 2021
25 May 2021

Unsupervised and self-supervised deep learning approaches for biomedical text mining.
Mohamed Nadif ... François Role
Briefings in bioinformatics | VOL. 22
Mohamed Nadif, et. al.Mohamed Nadif ... François Role
11 Feb 2021
Briefings in bioinformatics | VOL. 22

A Novel Multi-Task Self-Supervised Representation Learning Paradigm
Yinggang Li ... Junwei Hu
Control theory & applications | VOL. -
Yinggang Li, et. al.Yinggang Li ... Junwei Hu
28 May 2021
Control theory & applications | VOL. -

Exploring PolSAR Images Representation via Self-Supervised Learning and Its Application on Few-Shot Classification
Wu Zhang ... Zongxu Pan
IEEE Geoscience and Remote Sensing Letters | VOL. 19
Wu Zhang, et. al.Wu Zhang ... Zongxu Pan
01 Jan 2021
IEEE Geoscience and Remote Sensing Letters | VOL. 19

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Self-supervised Regularization for Text Classification

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Transactions of the Association for Computational Linguistics