Text Sentiment Polarity Classification Method Based on Word Embedding

Xiaojie Sun,Menghao Du,Hua Shi,Wenming Huang

doi:10.1145/3242840.3242850

Xiaojie Sun, Menghao Du + Show 2 more

https://doi.org/10.1145/3242840.3242850

Copy DOI

Export

Save

Cite

Abstract
Full-Text
Similar Papers

Abstract

Listen

Most of the machine learning algorithms for text sentiment analysis use the word embedding obtained by word2vec training as their inputs. However, the word embedding of word2vec training contains only semantic information. An algorithm for text sentiment analysis is proposed solve the problem of text containing semantics, syntax, sentiment and other information. It begins with the learning of original text-multi word embedding in the semantic, syntactic, and sentiment information, followed by proceeding the word embedding fusion. The improved convolution neural network is applied for sentiment analysis. Thus, it solves the problem that the word embedding contains monotonous text information. K-means text clustering is applied by dividing similar text into the same cluster, thus improving the classification accuracy. The application of the Principal Component Analysis (PCA) dimensionality not only extracts the principal component information, but also solves the problem of redundancy embedding and improves the computational performance of classification model. The experiment results show that the presented method has a significant improvement in the accuracy, recall rate and F value of the sentiment polarity analysis of the critical text in comparison with other fusion algorithms.

Full Text