Abstract

Abstract: In today's digitally-driven landscape, social media platforms such as Twitter, Facebook, and WordPress have emerged as critical conduits for public discourse, resulting in an overwhelming influx of unprocessed data, particularly on Twitter. Coping with the sheer volume of this data presents a significant challenge. To address this, sentiment analysis, a technique that categorizes sentiments into positive, negative, or neutral, offers a promising solution. This study delves into three main approaches for sentiment analysis: Machine learning-based methods, Sentiment lexicon-based approaches, and Hybrid methods. Its primary research objectives revolve around the identification of suitable algorithms and metrics for evaluating the performance of Machine Learning Classifiers. Additionally, the study aims to compare these metrics with respect to the dataset size, gauging their impact on the most appropriate sentiment analysis algorithm.The research methodology applied is experimental, entailing a rigorous assessment of algorithms using carefully selected metrics. The results of this investigation spotlight Naïve Bayes, Random Forest, XGBoost, and CNN-LSTM as the leading machine learning algorithms under consideration. These algorithms are assessed based on key performance metrics such as precision, accuracy, F1 score, and recall. Notably, the CNN-LSTM model emerges as the optimal choice for sentiment analysis of Twitter data within the specified dataset size, achieving a remarkable accuracy rate of 88%.In summary, this research successfully pinpoints the most suitable algorithm for sentiment analysis of Twitter data, especially in the context of dataset size. The CNN- LSTM model showcases its effectiveness, serving as a robust tool for sentiment analysis and delivering an impressive accuracy rate. This study significantly enhances our comprehension of public sentiment on Twitter, providing valuable insights into the ever-evolving realm of digital discourse and the analysis of vast unprocessed data

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call