A Historical Survey of Advances in Transformer Architectures

Ali Reza Sajun,Imran Zualkernan,Donthi Sankalpa

doi:10.3390/app14104316

Abstract

In recent times, transformer-based deep learning models have risen in prominence in the field of machine learning for a variety of tasks such as computer vision and text generation. Given this increased interest, a historical outlook at the development and rapid progression of transformer-based models becomes imperative in order to gain an understanding of the rise of this key architecture. This paper presents a survey of key works related to the early development and implementation of transformer models in various domains such as generative deep learning and as backbones of large language models. Previous works are classified based on their historical approaches, followed by key works in the domain of text-based applications, image-based applications, and miscellaneous applications. A quantitative and qualitative analysis of the various approaches is presented. Additionally, recent directions of transformer-related research such as those in the biomedical and timeseries domains are discussed. Finally, future research opportunities, especially regarding the multi-modality and optimization of the transformer training process, are identified.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Applied Sciences	Publication Date: May 20, 2024
Citations: 1	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

A Historical Survey of Advances in Transformer Architectures

Abstract

Talk to us

Similar Papers

More From: Applied Sciences

Lead the way for us

Similar Papers

Topic-Controlled Text Generation
Cansen Caglayan ... Murat Karakaya
-
Cansen Caglayan, et. al.Cansen Caglayan ... Murat Karakaya
15 Sep 2021
15 Sep 2021

A New Human Factor Study in Developing Practical Vision-Based Applications with the Transformer-Based Deep Learning Model
Thitirat Siriborvornratanakul
-
Thitirat SiriborvornratanakulThitirat Siriborvornratanakul
01 Jan 2021
01 Jan 2021

Real time Semantic Segmentation for Human-Labeled Data: A comparative Study Between CNN and Transformer
Connie Chen
Journal of Student Research | VOL. 13
Connie ChenConnie Chen
29 Feb 2024
Journal of Student Research | VOL. 13

Modeling language and cognition with deep unsupervised learning: a tutorial overview.
Marco Zorzi ... Alberto Testolin
Frontiers in Psychology | VOL. 4
Marco Zorzi, et. al.Marco Zorzi ... Alberto Testolin
01 Jan 2013
Frontiers in Psychology | VOL. 4

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Historical Survey of Advances in Transformer Architectures

Abstract

Talk to us

Similar Papers

More From: Applied Sciences