Determining Fuzzy Membership for Sentiment Classification: A Three-Layer Sentiment Propagation Model.

Chuanjun Zhao,Deyu Li,Suge Wang,Quan Zou

doi:10.1371/journal.pone.0165560

Abstract

Enormous quantities of review documents exist in forums, blogs, twitter accounts, and shopping web sites. Analysis of the sentiment information hidden in these review documents is very useful for consumers and manufacturers. The sentiment orientation and sentiment intensity of a review can be described in more detail by using a sentiment score than by using bipolar sentiment polarity. Existing methods for calculating review sentiment scores frequently use a sentiment lexicon or the locations of features in a sentence, a paragraph, and a document. In order to achieve more accurate sentiment scores of review documents, a three-layer sentiment propagation model (TLSPM) is proposed that uses three kinds of interrelations, those among documents, topics, and words. First, we use nine relationship pairwise matrices between documents, topics, and words. In TLSPM, we suppose that sentiment neighbors tend to have the same sentiment polarity and similar sentiment intensity in the sentiment propagation network. Then, we implement the sentiment propagation processes among the documents, topics, and words in turn. Finally, we can obtain the steady sentiment scores of documents by a continuous iteration process. Intuition might suggest that documents with strong sentiment intensity make larger contributions to classification than those with weak sentiment intensity. Therefore, we use the fuzzy membership of documents obtained by TLSPM as the weight of the text to train a fuzzy support vector machine model (FSVM). As compared with a support vector machine (SVM) and four other fuzzy membership determination methods, the results show that FSVM trained with TLSPM can enhance the effectiveness of sentiment classification. In addition, FSVM trained with TLSPM can reduce the mean square error (MSE) on seven sentiment rating prediction data sets.

Highlights

Following the popularization of forums, blogs, and online shopping websites, amount of usergenerated reviews are growing explosively [1]
As compared with support vector machine (SVM) and four other fuzzy membership determination methods, the experimental results show that fuzzy support vector machine model (FSVM) trained with three-layer sentiment propagation model (TLSPM) can increase the accuracy of sentiment classification
After describing the manner in which the sentiment scores and fuzzy membership of all training documents are obtained by the sentiment network and the sentiment propagation algorithm (SPA), we introduce their usage in sentiment classification by the FSVM

Summary

Introduction

Following the popularization of forums, blogs, and online shopping websites, amount of usergenerated reviews are growing explosively [1]. To get more accurate sentiment classification results, we use the absolute value of sentiment score as the fuzzy membership to train the FSVM. In order to achieve better sentiment classification results, we give higher weights to training samples having a strong sentiment intensity of positive or negative polarity, and lower weights to those having weak sentiment intensity. By using these weighted training samples, a text sentiment classifier of an FSVM can be obtained. As compared with SVM and four other fuzzy membership determination methods, the experimental results show that FSVM trained with TLSPM can increase the accuracy of sentiment classification. FSVM trained with TLSPM can reduce the mean square error (MSE) on seven sentiment rating prediction data sets

Related work

Evaluation metrics

Methods

Conclusions and future work