Abstract

Social media has recently become a rich resource in mining user sentiments. In this paper, Twitter has been chosen as a platform for opinion mining in trading strategy with Mubasher products, which is a leading stock analysis software provider in the Gulf region. This experiment proposes a model for sentiment analysis of Saudi Arabic (standard and Arabian Gulf dialect) tweets to extract feedback from Mubasher products. A hybrid of natural language processing and machine learning approaches on building models are used to classify tweets according to their sentiment polarity into one of the classes positive, negative and neutral. Firstly, document's Pre-processing are explored on the dataset. Secondly, Naive Bayes and Support Vector Machines (SVMs) are applied with different feature selection schemes like TF-IDF (Term Frequency-Inverse Document Frequency) and BTO (Binary-Term Occurrence). Thirdly, the proposed model for sentiment analysis is expanded to obtain the results for N-Grams term of tokens. Finally, human has labelled the data and this may involve some mistakes in the labelling process. At this moment, neutral class with generalisation of our classification will take results to different classification accuracy.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call