Multi-Task Learning in Natural Language Processing: An Overview

Shijie Chen,Qiang Yang,Yu Zhang

doi:10.1145/3663363

Abstract

Deep learning approaches have achieved great success in the field of Natural Language Processing (NLP). However, directly training deep neural models often suffer from overfitting and data scarcity problems that are pervasive in NLP tasks. In recent years, Multi-Task Learning (MTL), which can leverage useful information of related tasks to achieve simultaneous performance improvement on these tasks, has been used to handle these problems. In this article, we give an overview of the use of MTL in NLP tasks. We first review MTL architectures used in NLP tasks and categorize them into four classes, including parallel architecture, hierarchical architecture, modular architecture, and generative adversarial architecture. Then we present optimization techniques on loss construction, gradient regularization, data sampling, and task scheduling to properly train a multi-task model. After presenting applications of MTL in a variety of NLP tasks, we introduce some benchmark datasets. Finally, we make a conclusion and discuss several possible research directions in this field.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: ACM Computing Surveys	Publication Date: Jul 25, 2024
Citations: 2	License type: mit

R Discovery Prime

R Discovery Prime

Multi-Task Learning in Natural Language Processing: An Overview

Abstract

Talk to us

Similar Papers

More From: ACM Computing Surveys

Lead the way for us

Similar Papers

Multitask Learning as Question Answering with BERT
Shishir Roy ... Nayeem Ehtesham
-
Shishir Roy, et. al.Shishir Roy ... Nayeem Ehtesham
18 Dec 2021
18 Dec 2021

MaChAmp at SemEval-2022 Tasks 2, 3, 4, 6, 10, 11, and 12: Multi-task Multi-lingual Learning for a Pre-selected Set of Semantic Datasets

-

09 Jul 2022
MaChAmp at SemEval-2022 Tasks 2, 3, 4, 6, 10, 11, and 12: Multi-task Multi-lingual Learning for a Pre-selected Set of Semantic Datasets

Keynote - AI for the Public Sector and the Case of Legal NLP
Matthias Stürmer
-
Matthias StürmerMatthias Stürmer
03 Apr 2023
03 Apr 2023

Using Eye-tracking Data to Predict the Readability of Brazilian Portuguese Sentences in Single-task, Multi-task and Sequential Transfer Learning Approaches
Sidney Evaldo Leal ... Elisângela Nogueira Teixeira
-
Sidney Evaldo Leal, et. al.Sidney Evaldo Leal ... Elisângela Nogueira Teixeira
01 Jan 2020
01 Jan 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multi-Task Learning in Natural Language Processing: An Overview

Abstract

Talk to us

Similar Papers

More From: ACM Computing Surveys