Abstract

Sentiment analysis is one among the distinguished fields of knowledge and pattern mining that deals with the identification and analysis of sentiment within the text. The main challenges in sentiment analysis are word ambiguity and multi polarity. The problem of word ambiguity is to define polarity because the polarity for words is context dependent. The tweets are initially preprocessed. The preprocessing includes the removal of stop words, and lower case conversion. The tweets are then passed to the feature extraction techniques. Then the data is splitted as training and testing data. The trained data is passed to the different machine learning algorithm like Naive Bayes. Support Vector machine, Random forest, and Decision Tree and k-NN algorithm. The accuracy obtained using the Naive Bayes. Support Vector machine, random forest, and Decision Tree, k-NN and Logistic regression algorithm is 80%, 77%, 72%, 61% ,56% and 78%. The naïve bayes algorithm has achieved a better accuracy when compared to the other algorithm. KEYWORDS: SVM, Naive bayes, Decision tree, Random forest

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.