An Improved Model for Analyzing Textual Sentiment Based on a Deep Neural Network Using Multi-Head Attention Mechanism

Hashem Saleh Sharaf Al-Deen,Zhiwen Zeng,Raeed Al-Sabri,Arash Hekmat

doi:10.3390/asi4040085

Hashem Saleh Sharaf Al-Deen, Zhiwen Zeng + Show 2 more

Open Access

https://doi.org/10.3390/asi4040085

Copy DOI

Journal: Applied System Innovation	Publication Date: Oct 31, 2021
Citations: 16	License type: CC BY 4.0

Affiliation: Central South University

Abstract

Due to the increasing growth of social media content on websites such as Twitter and Facebook, analyzing textual sentiment has become a challenging task. Therefore, many studies have focused on textual sentiment analysis. Recently, deep learning models, such as convolutional neural networks and long short-term memory, have achieved promising performance in sentiment analysis. These models have proven their ability to cope with the arbitrary length of sequences. However, when they are used in the feature extraction layer, the feature distance is highly dimensional, the text data are sparse, and they assign equal importance to various features. To address these issues, we propose a hybrid model that combines a deep neural network with a multi-head attention mechanism (DNN–MHAT). In the DNN–MHAT model, we first design an improved deep neural network to capture the text’s actual context and extract the local features of position invariants by combining recurrent bidirectional long short-term memory units (Bi-LSTM) with a convolutional neural network (CNN). Second, we present a multi-head attention mechanism to capture the words in the text that are significantly related to long space and encoding dependencies, which adds a different focus to the information outputted from the hidden layers of BiLSTM. Finally, a global average pooling is applied for transforming the vector into a high-level sentiment representation to avoid model overfitting, and a sigmoid classifier is applied to carry out the sentiment polarity classification of texts. The DNN–MHAT model is tested on four reviews and two Twitter datasets. The results of the experiments illustrate the effectiveness of the DNN–MHAT model, which achieved excellent performance compared to the state-of-the-art baseline methods based on short tweets and long reviews.

Highlights

Sentiment analysis (SA) of text aims to extract and analyze knowledge from the personal information posted on the internet
We investigate the effectiveness of the deep neural networks (DNN)–multi-head attention (MHAT) model on two types of datasets: long reviews and short tweets on social media
The major major difference difference between between our our model model and and the is that proposed model considers the following significant features simultaneously: (i) short and proposed model considers the following significant features simultaneously: (i) short and long context dependencies utilizing bidirectional long short-term memory units (Bi-long short-term memory (LSTM)); (ii) identifying most significant features strong to positional changes utilizing convolutional neural network (CNN) with various kernels, filter sizes, and pooling mechanisms; (iii) capturing capturing the words words in the text that are significantly related related to long space and encoding dependencies utilizing a multi-head attention mechanism

Summary

Introduction

Sentiment analysis (SA) of text aims to extract and analyze knowledge from the personal information posted on the internet. Most of the previous approaches for SA have trained shallow techniques on carefully developed efficient features for obtaining satisfactory polarity categorization performances [3] These models occasionally apply traditional classification approaches involving Naïve Bayes, support vector machines (SVM), and latent Dirichlet allocation (LDA) to linguistic properties, such as lexical features, part-of-speech (POS) tags, and n-grams. These approaches have two major drawbacks: (1) the feature distance on which the model must be trained is highly dimensional and scattered and affects the model performance; (2) the feature engineering operation is time intensive and an uphill task

Methods

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

An Improved Model for Analyzing Textual Sentiment Based on a Deep Neural Network Using Multi-Head Attention Mechanism

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied System Innovation

Lead the way for us

Similar Papers

Deep distributed convolutional neural networks: Universality
Ding-Xuan Zhou
Analysis and Applications | VOL. 16
Ding-Xuan ZhouDing-Xuan Zhou
01 Nov 2018
Analysis and Applications | VOL. 16

Evaluating Uses of Deep Learning Methods for Causal Inference
Albert Whata ... Charles Chimedza
IEEE Access | VOL. 10
Albert Whata, et. al.Albert Whata ... Charles Chimedza
01 Jan 2021
IEEE Access | VOL. 10

Sentiment Analysis using a CNN-BiLSTM Deep Model Based on Attention Classification
Wang Yue ... Li Lei
Information | VOL. 26
Wang Yue, et. al.Wang Yue ... Li Lei
15 Sep 2023
Information | VOL. 26

An Improved Approach for Text Sentiment Classification Based on a Deep Neural Network via a Sentiment Attention Mechanism
Wenkuan Li ... Wenfeng Liu
Future Internet | VOL. 11
Wenkuan Li, et. al.Wenkuan Li ... Wenfeng Liu
11 Apr 2019
Future Internet | VOL. 11

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An Improved Model for Analyzing Textual Sentiment Based on a Deep Neural Network Using Multi-Head Attention Mechanism

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied System Innovation