The use of social media platforms, such as Twitter, has grown exponentially over the years, and it has become a valuable source of information for various fields, including marketing, politics, and finance. Sentiment analysis is particularly relevant in social media analysis. Sentiment analysis involves the use of natural language processing (NLP) techniques to automatically determine the sentiment expressed in a given text, such as positive, negative, or neutral.
 In this research paper, we focus on Twitter sentiment analysis and identify the most influential users in a given topic. We propose a methodology based on machine learning techniques to perform sentiment analysis and identify the most influential users on Twitter based on popularity. Specifically, we utilize a combination of NLP techniques, sentiment lexicons, and machine learning algorithms to classify tweets as positive, negative, or neutral. We then employ popularity calculations for each user to identify the top 10 most influential users on a given topic.
 The proposed methodology was tested on a large dataset of US airlines tweets which is related to a specific topic i.e. airlines, and the results show that the approach can effectively classify tweets according to sentiment and identify the most influential users. We evaluated the performance of several machine learning algorithms, including Multinomial Naive Bayes, Support Vector Machines (SVM), Decision Trees, Gradient Boosting, logistic regression, AdaBoost, KNN and Random Forest, and found that the logistic regression algorithm has achieved the highest accuracy.
 The proposed methodology has several implications for various fields, such as marketing, where sentiment analysis can help companies understand consumer behavior and tailor their marketing strategies accordingly. Moreover, identifying the most influential users can provide insights into opinion leaders in a given topic and help companies and policymakers target their messages more effectively.
Read full abstract