Evolutionary Multiobjective Feature Selection for Sentiment Analysis

Ayca Deniz,Pelin Angin,Merih Angin

doi:10.1109/access.2021.3118961

Ayca Deniz, Pelin Angin + Show 1 more

Open Access

https://doi.org/10.1109/access.2021.3118961

Copy DOI

Journal: IEEE Access	Publication Date: Jan 1, 2021
Citations: 13	License type: CC BY 4.0

Affiliation: Middle East Technical University

Abstract

Sentiment analysis is one of the prominent research areas in data mining and knowledge discovery, which has proven to be an effective technique for monitoring public opinion. The big data era with a high volume of data generated by a variety of sources has provided enhanced opportunities for utilizing sentiment analysis in various domains. In order to take best advantage of the high volume of data for accurate sentiment analysis, it is essential to clean the data before the analysis, as irrelevant or redundant data will hinder extracting valuable information. In this paper, we propose a hybrid feature selection algorithm to improve the performance of sentiment analysis tasks. Our proposed sentiment analysis approach builds a binary classification model based on two feature selection techniques: an entropy-based metric and an evolutionary algorithm. We have performed comprehensive experiments in two different domains using a benchmark dataset, Stanford Sentiment Treebank, and a real-world dataset we have created based on World Health Organization (WHO) public speeches regarding COVID-19. The proposed feature selection model is shown to achieve significant performance improvements in both datasets, increasing classification accuracy for all utilized machine learning and text representation technique combinations. Moreover, it achieves over 70% reduction in feature size, which provides efficiency in computation time and space.

Highlights

The significant advances in data storage, communication and processing technologies in recent years have given rise to the big data era, with a plethora of information flowing in from various data sources at high speeds
One of the main challenges in sentiment classification is the high amount of data that contain irrelevant or redundant features [27], which adversely affect the performance of machine learning models [28]
In this paper, we proposed a hybrid multiobjective feature selection algorithm to improve the performance of the sentiment classification task in various domains

Summary

INTRODUCTION

The significant advances in data storage, communication and processing technologies in recent years have given rise to the big data era, with a plethora of information flowing in from various data sources at high speeds. One of the main challenges in sentiment classification is the high amount of data that contain irrelevant or redundant features [27], which adversely affect the performance of machine learning models [28]. There exist feature selection methods that combine filter and wrapper based approaches for sentiment analysis [36], [37], all of them approach the problem in a single objective perspective. We propose a new hybrid multiobjective feature selection model for the sentiment analysis task, which harnesses the power of an entropy-based metric, i.e., Information Gain, and an evolutionary algorithm, i.e., Nondominated Sorting Genetic Algorithm II (NSGA-II). Experiments with different machine learning and feature extraction techniques on the well-known Stanford Sentiment Treebank dataset demonstrate that our proposed model improves the learning performance of the sentiment analysis task considerably.

RELATED WORK

PROPOSED MODEL

EXPERIMENT RESULTS

Findings

CONCLUSION

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Evolutionary Multiobjective Feature Selection for Sentiment Analysis

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

A Review of Feature Selection Techniques in Sentiment Analysis Using Filter, Wrapper, or Hybrid Methods
Pulung Hendro Prastyo ... Igi Ardiyanto
-
Pulung Hendro Prastyo, et. al.Pulung Hendro Prastyo ... Igi Ardiyanto
07 Sep 2020
07 Sep 2020

Efficient feature selection techniques for sentiment analysis
Avinash Madasu ... Sivasankar Elango
Multimedia Tools and Applications | VOL. 79
Avinash Madasu, et. al.Avinash Madasu ... Sivasankar Elango
14 Dec 2019
Multimedia Tools and Applications | VOL. 79

Intelligent Hybrid Feature Selection for Textual Sentiment Classification
Jawad Khan ... Youngmoon Lee
IEEE Access | VOL. 9
Jawad Khan, et. al.Jawad Khan ... Youngmoon Lee
01 Jan 2020
IEEE Access | VOL. 9

An Experimental Study of Feature Extraction Techniques in Opinion Mining
Ashok Kumar J ... Abirami S
International Journal on Soft Computing, Artificial Intelligence and Applications | VOL. 4
Ashok Kumar J, et. al.Ashok Kumar J ... Abirami S
28 Feb 2015
International Journal on Soft Computing, Artificial Intelligence and Applications | VOL. 4

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Evolutionary Multiobjective Feature Selection for Sentiment Analysis

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access