A Generalized Attention Mechanism to Enhance the Accuracy Performance of Neural Networks.

Pengcheng Jiang,Ferrante Neri,Yu Xue,Ujjwal Maulik

doi:10.1142/s0129065724500631

Abstract

In many modern machine learning (ML) models, attention mechanisms (AMs) play a crucial role in processing data and identifying significant parts of the inputs, whether these are text or images. This selective focus enables subsequent stages of the model to achieve improved classification performance. Traditionally, AMs are applied as a preprocessing substructure before a neural network, such as in encoder/decoder architectures. In this paper, we extend the application of AMs to intermediate stages of data propagation within ML models. Specifically, we propose a generalized attention mechanism (GAM), which can be integrated before each layer of a neural network for classification tasks. The proposed GAM allows for at each layer/step of the ML architecture identification of the most relevant sections of the intermediate results. Our experimental results demonstrate that incorporating the proposed GAM into various ML models consistently enhances the accuracy of these models. This improvement is achieved with only a marginal increase in the number of parameters, which does not significantly affect the training time.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Generalized Attention Mechanism to Enhance the Accuracy Performance of Neural Networks.

Abstract

Talk to us

Similar Papers

More From: International journal of neural systems

Lead the way for us

Similar Papers

Scientific Inference with Interpretable Machine Learning: Analyzing Models to Learn About Real-World Phenomena
Timo Freiesleben ... Álvaro Tejero-Cantero
Minds and Machines | VOL. 34
Timo Freiesleben, et. al.Timo Freiesleben ... Álvaro Tejero-Cantero
15 Jul 2024
Minds and Machines | VOL. 34

Detecting APS failures using LSTM-AE and anomaly transformer enhanced with human expert analysis
Mehmet E Mumcuoglu ... Kerem Koprubasi
Engineering Failure Analysis | VOL. 165
Mehmet E Mumcuoglu, et. al.Mehmet E Mumcuoglu ... Kerem Koprubasi
23 Aug 2024
Engineering Failure Analysis | VOL. 165

Optimal Donor Selection for Hematopoietic Cell Transplantation Using Bayesian Machine Learning.
Brent R Logan ... Purushottam W Laud
JCO Clinical Cancer Informatics | VOL. 5
Brent R Logan, et. al.Brent R Logan ... Purushottam W Laud
01 Dec 2021
JCO Clinical Cancer Informatics | VOL. 5

LO22: Risk-stratification of emergency department syncope by artificial intelligence using machine learning: human, statistics or machine
L Grant ... P Joo
CJEM | VOL. 22
L Grant, et. al.L Grant ... P Joo
01 May 2020
CJEM | VOL. 22

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Generalized Attention Mechanism to Enhance the Accuracy Performance of Neural Networks.

Abstract

Talk to us

Similar Papers

More From: International journal of neural systems