Abstract

Background/Objectives: Sentiment analysis plays main role in various text mining problems. Although, the Arabic text mining is important especially in the field of sentiment analysis, there is a paucity of research in it, especially, when it plays an important role in different issues in Arabic countries. Arabic language has many dialects that people use to express their feelings in social media. The objective of this study is to perform an experiment that follow the subjective opinion from the text. Subjective Analysis is one way that we can implement to improve the accuracy of the sentiment results in such texts in some dialects, that hide various meanings behind the words such as Saudi dialect. Methods/Statistical analysis: In this study, we manually annotated more than 8,000 tweets to have training and testing data sets with positive or negative words and phrases. Then we proposed a “Bag of Phrases” methodology to analyze the sentiments in the texts, which helped to improve the performance of sentiment analysis. Since using bag of words method is not enough in many cases, we applied a Naive Bayes algorithm to test our method. Findings: The results show that the accuracy of having True positive or True negative is about 84% comparing by using manual annotation process. The accuracy is calculated after taking into consideration the margin of error due to the manual annotation step and subjective interpretation of the texts by the annotators. Novelty/Applications: The novelty of the study is having more accurate training data set comparing with the other works in Saudi dialect for Arabic text, and proposing the BoPh concept.

Highlights

  • Sentiment Analysis has played a major role in achieving different goals in many organizations

  • We explored a very helpful review paper about Arabic Sentiment Analysis ASA (14), and we found couple studies that show a variance in the goals and results

  • Manual annotation was applied to Arabic text data “tweets” for Saudi dialect, and positive or negative sentiments were determined

Read more

Summary

Introduction

Sentiment Analysis has played a major role in achieving different goals in many organizations. Using customer reviews to develop or improve specific services or goods. There are different critical points that must be considered to complete that achievement, such as the source of the collected data, the volume and comprehensive the data, and the ability of processing the data to get some results from it. In (1) the authors mentioned the enormous amount of data that would be produced from the web and social media during the couple of years. Alshammari / Indian Journal of Science and Technology 2020;13(40):4202–4215 be about 40 trillion gigabytes in 2020. The source of the text data might be different, most of them are from the social media applications such as Twitter, Facebook, YouTube and so on(3)

Objectives
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call