Abstract

Sentiment Analysis technique involves extracting the relevant information from Unstructured User Reviews (UUR) dataset fetched from online and classifying them into appropriate positive and negative comments for making decisions. In UUR, data may be in noisy state, irrelevant features exist which creates high dimensional feature space. To design an effective sentiment learning model, users are required to extract the most relevant sentiment features from UUR. To overcome the issue, we proposed a Linguistic rule based feature selection method for extracting and selecting the sentiment features for Sentiment Analysis as it improves the predictive performance of classification algorithms. The proposed novel feature selection method involves identifying the various sentiment features in the review dataset by using filtering methods such as POS tags, n-grams. In the ensemble model, where the Random Forest classification algorithm is trained for textual sentiment classification, the chosen sentiment feature sets are used. Finally, we test our approach using the real-time review dataset that was collected from a multitude of sources, and the results demonstrate prediction accuracy that is superior to that of existing Sentiment analysis techniques.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.