A Systematic Review of Transformer-Based Pre-Trained Language Models through Self-Supervised Learning

Evans Kotei,Ramkumar Thirunavukarasu

doi:10.3390/info14030187

Evans Kotei, Ramkumar Thirunavukarasu

Open Access

https://doi.org/10.3390/info14030187

Copy DOI

Journal: Information	Publication Date: Mar 16, 2023
Citations: 22	License type: CC BY 4.0

Affiliation: Vellore Institute of Technology University

Abstract

Transfer learning is a technique utilized in deep learning applications to transmit learned inference to a different target domain. The approach is mainly to solve the problem of a few training datasets resulting in model overfitting, which affects model performance. The study was carried out on publications retrieved from various digital libraries such as SCOPUS, ScienceDirect, IEEE Xplore, ACM Digital Library, and Google Scholar, which formed the Primary studies. Secondary studies were retrieved from Primary articles using the backward and forward snowballing approach. Based on set inclusion and exclusion parameters, relevant publications were selected for review. The study focused on transfer learning pretrained NLP models based on the deep transformer network. BERT and GPT were the two elite pretrained models trained to classify global and local representations based on larger unlabeled text datasets through self-supervised learning. Pretrained transformer models offer numerous advantages to natural language processing models, such as knowledge transfer to downstream tasks that deal with drawbacks associated with training a model from scratch. This review gives a comprehensive view of transformer architecture, self-supervised learning and pretraining concepts in language models, and their adaptation to downstream tasks. Finally, we present future directions to further improvement in pretrained transformer-based language models.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Systematic Review of Transformer-Based Pre-Trained Language Models through Self-Supervised Learning

Abstract

Talk to us

Similar Papers

More From: Information

Lead the way for us

Similar Papers

AMMU: A survey of transformer-based biomedical pretrained language models
Katikapalli Subramanyam Kalyan ... Sivanesan Sangeetha
Journal of Biomedical Informatics | VOL. 126
Katikapalli Subramanyam Kalyan, et. al.Katikapalli Subramanyam Kalyan ... Sivanesan Sangeetha
31 Dec 2021
Journal of Biomedical Informatics | VOL. 126

Application of Transformer-Based Language Models to Detect Hate Speech in Social Media
Swapnanil Mukherjee ... Sujit Das
Journal of Computational and Cognitive Engineering | VOL. 2
Swapnanil Mukherjee, et. al.Swapnanil Mukherjee ... Sujit Das
17 Dec 2021
Journal of Computational and Cognitive Engineering | VOL. 2

MuLan-Methyl-multiple transformer-based language models for accurate DNA methylation prediction.
Wenhuan Zeng ... Daniel H Huson
GigaScience | VOL. 12
Wenhuan Zeng, et. al.Wenhuan Zeng ... Daniel H Huson
28 Dec 2022
GigaScience | VOL. 12

Sorting Search Results of Literature Digital Libraries: Recent Developments and Future Research Directions
Sulieman Bani-Ahmad
-
Sulieman Bani-AhmadSulieman Bani-Ahmad
04 Apr 2011
04 Apr 2011

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Systematic Review of Transformer-Based Pre-Trained Language Models through Self-Supervised Learning

Abstract

Talk to us

Similar Papers

More From: Information