A computational method known as sentiment analysis is employed to ascertain the emotional undertone or attitude of a text document, such as a review, tweet, or news story. Using machine learning models, deep neural network models, and natural language processing, the method entails examining the text to determine whether it expresses positive or negative sentiment. In this study, models like Naive Bayes, Logistic Regression, LSTM, LSVM, Decision tree, and BiLSTM are utilized to conduct a sentiment analysis (SA) study on the IMDB dataset. The goal of the investigation is to evaluate how well these models perform in retrospect on movie reviews, categorizing them as positive or negative. The study investigates the effects of data pre-processing methods and hyperparameter tuning on the models’ accuracy. The final results demonstrate that the BiLSTM model outperforms the other models in terms of recall, precision, and accuracy, followed by the LSTM, Logistic Regression, LSVM, Decision Tree, and Naive Bayes models. The research emphasizes the potential of deep learning models—in particular, BiLSTM in sentiment analysis tasks, as well as the significance of hyper-parameter tuning and pre-processing methods in achieving high accuracy.
Read full abstract