Abstract

The massive volume of comments on websites and social networks has made it possible to raise awareness of people's beliefs and preferences regarding products and services on a large scale. For this purpose, sentiment analysis, which refers to the determination of the sentiment of texts, has been proposed as an intelligent solution. From a methodological point of view, the recent combination of words embedding and deep neural networks (DNNs) has become an effective approach for sentiment analysis. In Persian studies, formal corpuses such as Wikipedia dumps have been used for word embedding. The fundamental difference between formal and informal texts means that the vectors derived from formal texts in informal contexts such as social networks do not result in desirable accuracy. To overcome this drawback, in this paper, we provide a large integrated text corpus of several different sources of informal comments and we also utilize the Fasttext as the word embedding algorithm. In this research, we use Attention-based LSTM, which has been shown to perform more effectively compared to the similar methods in sentiment analysis for the English language. The proposed method is evaluated on the two Persian “Taaghche” and “Filimo” datasets collected in this paper. The experiments on the two Persian datasets prove that utilizing informal vectors in sentiment analysis and applying the attention model improves the prediction accuracy of the DNN in the sentiment analysis of Persian texts.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.