Multiple Positional Self-Attention Network for Text Classification

Biyun Dai,Jinlong Li,Ruoyi Xu

doi:10.1609/aaai.v34i05.6261

Abstract

Self-attention mechanisms have recently caused many concerns on Natural Language Processing (NLP) tasks. Relative positional information is important to self-attention mechanisms. We propose Faraway Mask focusing on the (2m + 1)-gram words and Scaled-Distance Mask putting the logarithmic distance punishment to avoid and weaken the self-attention of distant words respectively. To exploit different masks, we present Positional Self-Attention Layer for generating different Masked-Self-Attentions and a following Position-Fusion Layer in which fused positional information multiplies the Masked-Self-Attentions for generating sentence embeddings. To evaluate our sentence embeddings approach Multiple Positional Self-Attention Network (MPSAN), we perform the comparison experiments on sentiment analysis, semantic relatedness and sentence classification tasks. The result shows that our MPSAN outperforms state-of-the-art methods on five datasets and the test accuracy is improved by 0.81%, 0.6% on SST, CR datasets, respectively. In addition, we reduce training parameters and improve the time efficiency of MPSAN by lowering the dimension number of self-attention and simplifying fusion mechanism.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Multiple Positional Self-Attention Network for Text Classification

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the AAAI Conference on Artificial Intelligence	Publication Date: Apr 3, 2020
Citations: 4

Similar Papers

Complex-Valued Relative Positional Encodings for Transformer
Gang Yang ... Hongzhe Xu
-
Gang Yang, et. al.Gang Yang ... Hongzhe Xu
24 Feb 2023
24 Feb 2023

Efficient utilization of pre-trained models: A review of sentiment analysis via prompt learning
Kun Bu ... Xiaolong Ju
Knowledge-Based Systems | VOL. 283
Kun Bu, et. al.Kun Bu ... Xiaolong Ju
02 Nov 2023
Knowledge-Based Systems | VOL. 283

Natural Language Processing using Deep Learning in Social Media
María Teresa Giménez Fayos
-
María Teresa Giménez FayosMaría Teresa Giménez Fayos
02 Sep 2021
02 Sep 2021

Multi-Task Learning for Semantic Relatedness and Textual Entailment
Linrui Zhang ... Dan Moldovan
Journal of Software Engineering and Applications | VOL. 12
Linrui Zhang, et. al.Linrui Zhang ... Dan Moldovan
01 Jan 2019
Journal of Software Engineering and Applications | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multiple Positional Self-Attention Network for Text Classification

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence