Aspect Based Multi-Labeling Using SVM Based Ensembler

Khursheed Aurangzeb,Musaed Alhussein,Nasir Ayub

doi:10.1109/access.2021.3055768

Abstract

Sentiment analysis is one of the most prominent sub-areas of research in Natural Language Processing (NLP), where it is important to consider implicit or explicit emotions conveyed by review material. Researchers also recognized that the generic feelings derived from the textual material are insufficient, so the sentiment analysis aspect based was coined to extract the emotions from textual data. Multi-labeling based on aspects data can resolve the issue of extracting emotion aspect based. In this work, a novel approach namely: Evolutionary Ensembler (EEn) is proposed to effectively boost the accuracy and diversity of multi-label learners. Unlike traditional multi-label training methods, EEn emphasizes the accuracy and diversity of multi-label-based models. We have used seven datasets (medical, hotel, movies, automobiles, proteins, birds, emotions, news). At first, we applied a pre-processing step to gain the refined and clean data. Second, we have applied the Vader tool with Bag of Words (BoW) for the feature extraction. Third, the word2vec method is applied to draw an association between words. Moreover, the SVM model (tuned with GA) is trained and tested on the refined data. The accuracy of the aspect-based multi-labeling using the SVM-GA on medical, hotel, movies, automobiles, proteins, birds, emotions, news are 93.13%, 94.32%, 94.0%, 95.10%, 90.20%, 93.22%, 90.0%, and 94.0%, respectively. Our proposed model with different dimensions of multi-label datasets shows that EEn is vastly superior to other popular techniques. Experimental outcomes validate the success of the implemented approach among existing benchmark techniques.

Highlights

Sentiment analysis is an active research field that is being applied to different applications, i.e., understanding consumer’s feedbacks and public perceptions, and tracking real-world events [1]
The relation among words needs to be precisely defined and the word2vec framework offers three million words mapped with the same amount of phrases and words, it is quite possible to have a suitable relationship among words from such a large repository
A novel assembler based on an optimization algorithm has been proposed to tackle this issue

Summary

INTRODUCTION

Sentiment analysis is an active research field that is being applied to different applications, i.e., understanding consumer’s feedbacks and public perceptions, and tracking real-world events [1]. It is well familiar that by building multi-base learners in the single-label context, ensemble-learning enhances generalization capacity of the learning system and decreases the risk of over-fitting [12]. In the case of ML learning, the generalization potential of the ML learning system can be greatly enhanced, if we put together a cluster of simple multi-label learners to predict all the respective labels. This is known as the MLL ensemble [13]. We propose a novel way to utilize aspect-based multi-label classification for sentiment classification of emotions in textual data. We compare 5 state of the art multi-label classification methods with 8 emotion-based textual data-sets.

RELATED WORK

DATA PREPROCESSING

FEATURES EXTRACTION

SVM CLASSIFICATION FRAMEWORK

MULTI-OBJECTIVE OPTIMIZATION SOLUTION

35: Statistical analysis

CONCLUSION

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE access : practical innovations, open solutions	Publication Date: Jan 1, 2021
Citations: 51	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Aspect Based Multi-Labeling Using SVM Based Ensembler

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE access : practical innovations, open solutions

Lead the way for us

Similar Papers

From semantics to pragmatics: where IS can lead in Natural Language Processing (NLP) research
Yan Li ... Dapeng Liu
European Journal of Information Systems | VOL. 30
Yan Li, et. al.Yan Li ... Dapeng Liu
24 Sep 2020
European Journal of Information Systems | VOL. 30

Anaphora and coreference resolution: A review
Rhea Sukthanker ... Ramkumar Thirunavukarasu
Information Fusion | VOL. 59
Rhea Sukthanker, et. al.Rhea Sukthanker ... Ramkumar Thirunavukarasu
01 Feb 2020
Information Fusion | VOL. 59

A two-site survey of medical center personnel\u2019s willingness to share clinical data for research: implications for reproducible health NLP research
Chunhua Weng ... Casey A. Rommel
BMC medical informatics and decision making | VOL. 19
Chunhua Weng, et. al.Chunhua Weng ... Casey A. Rommel
01 Apr 2019
BMC medical informatics and decision making | VOL. 19

Natural Language Processing and Computational Linguistics
Junichi Tsujii
Computational Linguistics | VOL. -
Junichi TsujiiJunichi Tsujii
07 Dec 2021
Computational Linguistics | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Aspect Based Multi-Labeling Using SVM Based Ensembler

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE access : practical innovations, open solutions