Abstract

Facebook and Twitter, as two known social media become popular sources of big data that give the right to people to share and express their feedback about products, services, politicians, events, and every aspect of life in the form of short texts. The classification of sentiments could be automated through machine learning and enhanced using appropriate feature extraction methods. In this work, we collected the most recent tweets about (Biden, Benzema, Apple, and NASA) using Twitter-API and assigned sentiment scores using a rule-based lexicon approach; after pre-processing stage, each dataset is divided into 80% as a training set, and rest 20% as testing set. After that, the Distributed bag of words, Distributed memory mean, Distributed Memory Concatenation, and Term Frequency-Inverse Document Frequency models are used for feature extraction from pre-processed tweets. Depending on the Shark smell optimizer algorithm, the SVM technique was used to classify the extracted features. The SSO was used to tune and select the best value for SVM parameters to optimize the overall model performance. The results display that these optimizers have an essential impact on increasing the model accuracy. After optimization, the model accuracy reached 92.12%, while the highest accuracy without optimization was 88.69% for various feature extraction methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.